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1. TWTROPnCTION 

The present invention relates to methods and 
compositions for the diagnosis, prevention and treatment of 
tumor progression in mammals, for example, humans. The 
different types of tumors may include, but are not limited 
to, human melanomas, breast, gastrointestinal tumors such as 
esophageal, stomach, duodenal, colon, colorectal and rectal 
cancers, prostate, bladder, testicular, ovarian, uterine, 
cervical, brain, lung, bronchial, larynx, pharynx, liver, 
pancreatic, thyroid, bone, various types of skin cancers and 
neoplastic conditions such as leuXemias and lymphomas. 
Specifically, genes which are differentially expressed in 
tumor cells relative to normal cells and/or relative to 
tumor cells at a different stage of tumor progression are 
identified. For example, genes are identified which are 
differentially expressed in benign (etq. , non-malignant) 
tumor cells relative to malignant tumor cells exhibiting a 
high metastatic potential. Genes are also identified via 
the ability of their gene products to interact with gene 
products involved in the progression to and/or 
aggressiveness of neoplastic tumor disease states. The 
genes identified can be used diagnostically or as targets 
for therapeutic intervention. In this regard, the present 
invention provides methods for the identification of 
compounds useful in the d^nosis, prevention and 
therapeutic treatoent_of_tumor progression , including,., for 
exampl^TSetastatic neoplastic -di-sordecs.^ The present 
invention also provides methods for the identification of 
compounds useful in the diagnosis, prevention and 



therapeutic treatment of tumor progression, including, for 
example, pre-neoplastic and/or benign states. Additionally, 
methods are provided for the diagnostic evaluation and 
prognosis of conditions involving tumor progression, for the 
identification of subjects exhibiting a predisposition to 
such conditions, for monitoring patients undergoing clinical 
evaluation for the prevention and treatment of tumor 
progression disorders, and for monitoring the efficacy of 
compounds used in clinical trials. 

2. BACKGRODND OP THE INVEMTIOM 

Cancer is the second leading cause of death in the 
United States, after heart disease (Boring, C.C. et al., 
1993, CA Cancer J. Clin. 41:7), and develops in one in three 
Americans, and one of every four Americans dies of cancer. 
Cancer is characterized primarily by an increase in the 
number of abnormal, or neoplastic, cells derived from a 
given normal tissue which proliferate to form a tumor mass, 
the invasion of adjacent tissues by these neoplastic tumor 
cells, and the generation of malignant cells which spread 
via the blood or lymphatic system to regional lymph nodes 
and to distant sites. The latter progression to malignancy 
is referred to as metastasis. 

Cancer can be viewed as a breakdown in the 
communication between tumor cells and their environment, 
including their normal neighboring cells. Signals, both 
growth-stimulatory and growth-inhibitory, are routinely 
exchanged between cells within a tissue. Normally, cells do 
not divide in the absence of stimulatory signals, and, 
likewise, will cease dividing in the presence of inhibitory 
signals. In a cancerous, or neoplastic, state, a cell 
acquires the ability to "override" these signals and to 
proliferate under conditions in which normal cells would not 
grow. 
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Tumor cells must acquire a number of distinct 
aberrant traits to proliferate. Reflecting this requirement 
is the fact that the genomes of certain well-studied tumors 
carry several different independently altered genes, 
including activated oncogenes and inactivated tumor 
suppressor genes. Each of these genetic changes appears to 
be responsible for imparting some of the traits that, in 
aggregate, represent the full neoplastic phenotype (Land, H. 
et al., 1983, science 222:771; Ruley, H.E., 1983, Nature 
304:602; Hunter, T., 1991, Cell 64:249). 

in addition to unhindered cell proliferation, cells 
must acquire several traits for tumor progression to occur. 
For example, early on in tumor progression, cells must evade 
the host immune system. Further, as tumor mass increases, 
15 the tumor must acquire vasculature to supply nourishment and 
remove metabolic waste. Additionally, cells must acquire an 
ability to invade adjacent tissue, and, ultimately, cells 
often acquire the capacity to metastasize to distant sites. 

The biochemical basis for immune recognition of 
tumor cells is unclear. It is possible that the 
tumorigenicity .of cells can increase when the cells' display 
of Class I histocompatability antigens is reduced (Schrier, 
P.I. et al., 1983, Nature 1^5:771), in that these antigens, 
in conjunction with tumor-specific antigens are required for 
25 the tumor cells to be recognized by cytotoxic T lymphocytes 
(CTLs). Tumor cells which have lost one or more genes 
encoding tumor-specific antigens seem to escape recognition 
by the corresponding reactive CTLs (Van der Bruggen, P. et 
al., 1991, Science 254: 1643) . 

Once a tumor reaches more than about 1 mm in 
diameter, it can no longer rely on passive diffusion for 
nutrition and removal of metabolic waste. At this point, 
the tumor mass must make intimate contact with the 
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Circulatory system. Thus, cells within more advanc d tumors 
secrete angiogenic factors which promote neovascularization, 
i.e. . the growth of blood vessels from surrounding tissue 
into'the tumor mass (Folkman, J. and Klagsburn, M. , 1987, 
science 131:442; Liotta, L.A. et al., 1991, Cell M:327). 
Among these angiogenic factors are the fibroblast growth 
factor (FGF) and endothelial cell growth factor (ECGF) . 
Neovascularization can, in fact, be an essential precursor 
to metastasis. First, the process is required for a large 
increase in tumor cell number, which in turn, allows the 
appearance of rare metastatic variants. Further, 
neovascularization provides a direct portal entry into the 
circulatory system which can be used by metastasizing cells. 

A variety of biochemical factors have been 
associated with different phases of metastases. Cell 
surface receptors for collagen, glycoproteins such as 
laminin, or proteoglycans, facilitate tumor cell attachment, 
an important step in invasion and metastases. Attachment 
then triggers the release of degradative enzymes which 
facilitate the penetration of tumor cells through tissue 
barriers. Once the tumor cell has entered the target 
tissue, specific growth factors are required for further 

proliferation. 

It is apparent that the complex process of tumor 
25 progression must involve multiple gene products. It is 
therefore important to define the role of specific genes 
involved in tumor progression, to identify those gene 
products involved in the tumor progression process and to 
further identify those gene products which can serve as 
therapeutic targets for the diagnosis, prevention and 
treatment of metastases of various forms of cancers. 

some attempts have been made to study genes which 
are thought to elicit or augment tumor progression 



20 



30 



- 4 - 



phenotypes* Mutations may drive a wave of cellular 
multiplication associated with gradual increases in tumor 
size, disorganization and malignancy. For example, a 
mutation in the tumor suppressor gene which is a negative 
regulator of cellular proliferation, results in a loss of 
crucial control over tumor growth and progression. 
Differential expression of the following suppressor genes 
has been demonstrated in human cancers: the retinoblastoma 
gene, RB; the Wilms' tumor gene, WTl (lip); the gene deleted 
in colon carcinoma, DCC (18q); the neurofibromatosis type 1 
gene, NFl (17q); and the gene involved in familial 
adenomatous polyposis coli, APC (5q) (Vogelstein, B. and 
Kinzler, K.W. , 1993, Trends Genet. 9:138-141). 

Insight into the complex events that lead from 
normal cellular growth to neoplasia, invasion and metastasis 
is crucial for the development of effective diagnostic and 
therapeutic strategies. The foregoing studies are aimed at 
defining the role of particular gene products presumed to be 
involved in tumor progression. However, such approaches 
cannot identify the full panoply of gene products that are 
involved in the cascade of steps in tumor progression. A 
great need, therefore, exists for the successful 
identification of those genes which are differentially 
expressed in cells involved in or predisposed to a tumor 
progression phenotype. Such differentially expressed gene 
and/ or gene products can represent useful diagnostic markers 
and/or therapeutic targets for tumor progression disorders. 
With respect to diagnostic techniques, such genes and/or 
gene products could represent useful markers for the 
diagnosis, especially early diagnosis, given the correlation 
between early diagnosis and successful cancer treatment. 
With respect to therapeutic treatments, such differentially 
expressed genes and/ or gene products could represent useful 
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involving metastasis, to which such gene products can 
contribut , are described. 

The present invention also relates to methods for 
prognostic and diagnostic evaluation of tumor progression 
conditions, and for the identification of subjects 
containing cells predisposed to such conditions. 
Furthermore, the invention provides methods for evaluating 
the efficacy of therapies for disorders involving tumor 
progression, and for monitoring the progress of patients 
participating in clinical trials for the treatment of such 
diseases. 

The tumor progression disorders described herein can 
include disorders involved in the progression of such human 
cancers as, for example, human melanomas, breast, 
gastrointestinal, such as esophageal, stomach, colon, bowel, 
colorectal and rectal cancers, prostate, bladder, 
testicular, ovarian, uterine, cervical, brain, lung, 
bronchial, larynx, pharynx, liver, pancreatic, thyroid, 
bone, leukemias, lymphomas, and various types of skin 
cancers . 

The invention also provides methods for the 
identification of compounds that modulate the expression of 
genes or the activity of gene products involved in tumor 
progression, including the progression of metastatic 
neoplastic diseases, as well as methods for the treatment of 
such diseases. Such methods can, for example, involve the 
administration of such compounds to individuals exhibiting 
symptoms or markers of tumor progression, such as markers 
for metastatic neoplastic diseases. 

This invention is based, in part on systematic 
search strategies involving in vix2 and in vit£2 paradigms 
of tumor progression, including the progression to 
metastatic disease, coupled with sensitive and high 
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throughput gene xpression assays, to id ntify genes 
tmrouij f 1 ^„„„, - 113 relative to normal 

differentially expressed in tumor o 11= re 
cells and/or relative to tumor cells at a dxfferent tumor 
^^ogresslon sta,e. In contrast to approaches that merely 
Lluate the expression of a given ,ene product P---^^ " 
Blay a role in one or another of the various stages of tumor 
progression, such as, for example the progression " - 
LIstatic disease process, the search strategies and assays 
used herein permit the identification of all "^^^^^ 
Known or novel, which are differentially expressed in tumor 
cells relative to normal cells or relative to tumor cells at 
a different stage of tumor progression. 

This comprehensive approach and evaluation permits 
the discovery of novel genes and gene products, as well as 
the identification of an array of genes and gene products 
'whether novel or Known, involved in novel pathways that 
Play a major role in the disease pathology. Thus, the 
present invention maKes possible the "--"^"^""/^"^^^^^ 
Characterization of targets useful for P"^"-"' 
monitoring, rational drug design, and/or other *>^erapeutic 
intervention of tumor progression disorders, including 
disorders involving metastasis. 

The Example presented in Section 6, below, 
demonstrates the successful use of tumor progression search 
strategies of the invention to identify genes which are 
differentially expressed within tumor cells relative to 
tumor cells at a different stage of tumor progression, 
specif ioally, the Example identifies a gene ""ich is 
differentially expressed in metastatic cell populations 
relative to benign, non-malignant tumor cells. 

This gene, referred to herein as the 030 gene 
(,omy030 in the mouse and fohy030 in ' ..^^i"°ri-„. 
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metastatic tumor cells relative to its expressj^n_ in 
miiifetlc-CTSr^^nT.' The gene appears iri mice and has 
the CDNA sequence shown in Fig. 3 A and 3B (SEQ ID NO: 2) . A 
homologous gene, referred to herein as the fohy030 gene, 
5 appears in humans and has the cDNA sequence shown in Fig. 5 
(SEQ ID NO: 6). An alternative splice form of the human cDNA 
has the sequence shown in FIG. 6 (SEQ ID N0:8). Unless 
stated expressly otherwise, any general reference to the 030 
gene hereinafter refers to both the murine (fomy030) and 
10 human (fohy030) homologs of this gene. 

The identification of the 030 gene and the 
characterization of its expression in particular stages of 
metastatic spread provides, therefore, newly identified 
targets for the diagnosis, prevention, and treatment of 
tumor progression disorders, including metastatic neoplastic 
diseases. 

Its expression pattern indicates that the 030 gene 
product acts to inhibit tumor progression. For example, a 
reduction in the level of 030 gene expression correlates 
with an increase in a cell's metastatic potential i^, a 
reduction of 030 gene product in tumor cells can induce or 
predispose a cell to progress to a metastatic state. 

Hence, any method which can bring about an increase 
in the amount of 030 gene product can inhibit or slow the 
progression to metastasis. In fact, it is possible that the 
030 gene product exhibits general tumor inhibition 
properties . 

A cDNA clone of the murine homolog, designated 
fomy030, is described herein in FIGS. 3A and 3B (SEQ ID 
N0:2) (nucleotide sequence and amino acid sequence), and was 
derived from fomy030 mRNA. However, as used herein, fomy030 
CDNA refers to any DNA sequence that encodes the amino acid 
sequence depicted in FIGS. 3A and 3B (SEQ ID N0:3). 
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referred to herein as belonging to different "tumor 

progression stages." 

A "disorder involving tumor progression" or a "tumor 
progression disorder," as used herein, r fers to the state 
of a cell or cells which have undergone or are in the 
process of undergoing a tumor progression event, as defined 
above . 

"Differential expression," as used herein, refers to 
both quantitative, as well as qualitative differences in the 
genes' temporal and/or cellular expression patterns among, 
for example, normal and neoplastic tumor cells, and/or among 
tumor cells which have undergone different tumor progression 
events. Differentially expressed genes can represent 
"fingerprint genes," and/or "target genes." 

"Fingerprint gene," as used herein, refers to a 
differentially expressed gene whose expression pattern can 
be utilized as part of a prognostic or diagnostic marker for 
the evaluation of tumor progression, or which, 
alternatively, can be used in methods for identifying 
compounds useful for the treatment of tumor progression. 
For example, the effect of the compound on the fingerprint 
gene expression normally displayed in connection with tumor 
progression can be used to evaluate the efficacy of the 
compound as a treatment for tumor progression, or can, 
additionally, be used to monitor patients undergoing 
clinical evaluation for the treatment of tumor progression. 

"Fingerprint pattern," as used herein, refers to the 
pattern generated when the expression pattern of a series 
(which can range from two up to all the fingerprint genes 
which exist for a given state) of fingerprint genes is 
determined. A fingerprint pattern can be used in the same 
diagnostic, prognostic and compound identification methods 
as the expression of a single fingerprint gene. 
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regulation of the 030 gene^ T - ^J^^ ^ ^/^^ 
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cell cultures was ^V--^^^^^^^^ (See materials 
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H7 and B16 H8. ^ ^^^^^^ „ „.eletide and 

aeduced a.ino acid sec^encesV cCNA clone of fohy030 (SEQ 
?D nL-6 cnucleotide sequence^nd 7 (amino ac.d se=p.ence„ . 
FIG 6 is->^mparison of the nucletide and deduced 

acid ;e^encTrS?^n^ii«^R5:i4^- °' '""^"^ 
Z tos:. (nucleotide .e,ue;;«r^STT>.ino acid se^ence,, . 

in FIGS. 3 A an\3B, the nucleotide sequence is 
nuM^etd starting at th\first nucleotide, ""ereas in FIGS^ 
5 and 6, the nucleotide i^ence is nu^red starting at the 
ATG start codon. 

5. r^ TT i "'" "MCBTPTTow OT TUB i^mnion 

Methods and compositions for the prevention, 
treatment and diagnosis of tumor progression, i"''!"*-^ 
t^or progression involving metastatic disorders, xn cells 
tZlved in human tumors. Such human tumors may inc uae, 
for example, human melanomas, breast, gastrointestinal 
tumors such as esophageal, stomach, duodenal, colon, 
colorectal and rectal cancers, prostate, 
testicular, ovarian, uterine, cervical, brain, lung 
bronchial, larynx, pharynx, liver, ^^^^^i^^, 
bone, various types of sKln cancers and other 
cond tlons such as leu3cemias, lymphomas. 

based, m part, on the evaluation and «=<P";"-^^* ^^^^s 

all genes that are differentially expressed in '^"""■^ 

: ative to normal cells and/or relative to t-or ce Us at a 

different stage of tumor progression. This permits the 

d flnltil of disease pathways and Identification of targets 

fn such pathways that are useful for diagnosis, prevention 

Tnd treatment of tumor progression, including the tumor 
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Methods for the cf gene 

modulate the expression of '^"^^ described in 

products involved in tumor P-^"^"/""^/ ^f 
lotion 5.3. Methods for monit «n, the^e^^^^ 
5 compounds during clinical tr ^^^^^ 

5.3.5. Additionally f^^2rs, including 

treatment of tumor progression disorder 

metastatic diseases. prognostic 

Mso discussed, below, ar ^ disorders 

>0 and diagnostic evaluation of ^^J^Z;^,^,,^ .^orders, 
Trt^rfortrirtUiratlon of subjects exhibiting 
Tpredlsposltlon to such disorders. 



5.1. IDBMTIPICATIOH OP DIPPEREMTIALLY 

PTPRBSflBP QBNEB . 

Described herein are methods for the identification 
of differentially expressed genes which are involved in 
5 tumor progression. There exist a number of levels or stages 
at which the differential expression of such genes can be 
exhibited. For example, differential expression can occur 
in tumor cells relative to normal cells, or in tumor cells 
within different stages of tumor progression. For example, 
10 genes can be identified which are differentially expressed 
in pre-neoplastic versus neoplastic cells. Such genes can 
include, for example, ones which promote unhindered cell 
proliferation or tumor cell invasion of adjacent tissue, 
both of which are viewed as hallmarks of the neoplastic 
15 state. Further, differential expression can occur in benign 
r e.q. . non-malignant) tumor cells versus metastatic, 
malignant tumor cells. Still further, differential 
expression can occur among cells within any one of these 
states (e^, pre-neoplastic, neoplastic and metastatic), 
20 and can indicate, for example, a difference in tumor 

progression severity or aggressiveness of one cell relative 
to that of another cell within the same state. 

Methods for the identification of such 
differentially expressed genes are described, below, in 
25 section 5.1.1. Methods for the further characterization of 
such differentially expressed genes, and for their 
categorization as target and/or fingerprint genes, are 
presented, below, in Section 5.3. 

"Differential expression" as used herein refers to 
30 both quantitative, as well as qualitative differences in the 
genes' temporal and/or tissue expression patterns. Thus, a 
differentially expressed gene can qualitatively have its 
expression activated or completely inactivated in, for 
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. normal versus tumor progression states, in cells 
xample, normal versus ^^.^tes or among cells 

Within different tumor P-^^^^^^/^^^^^; J^,. . s!ch a 
wi-in a single ^^ss.on 
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pattern withm a given s detectable in both 
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states being <^^2^^l\^^^^^^ ,s detectable via the 

to an RNA expression level wnx «m f reverse 

r=r:r"-»^ ...... 

'"''"".Xtematively. a ai«erentiaUv expressed ,e„e can 

an expression level ^^"^^J^ J^^. .^o. 

^ire:-: a.on, ce.. -.nin a sin,. ,iven 

atreVtc wnich expression ai«e« need onXv ^ 
The aegr standard characterization 

,..,e enough to . visu l ed ^^„^^^„,,,, 

techniques such ' „eXl-Kno«n 
"i r-^^^^^^^^^^ expression di«erences 

quantitative BT (reve techniques. 
Northern analyses and BNase proi further 
Oif ferentially expressed qenes can be father 
aescrihed as target genes and/ ornngerprint genes. 
..nngerprlnt gene," as used herein refers to 

*»vnressed qene whose expression 
aitferentially expressed ^ diagnostic marXer in 

5 b. utilized as part of a pr g alternatively, .ay 

Trererti:: treat.ent of tu.or progression and tu.or 
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progression disorders, including metastatic disorders, 
fingerprint gene can also have the characteristics of a 
target gene or a pathway gene (see below, in Section 5.2). 

"Fingerprint pattern," as used herein, refers to the 
5 pattern generated when the expression pattern of a series 
(which can range from two up to all the fingerprint genes 
which exist for a given state) of fingerprint genes is 
determined. A fingerprint pattern can be used in the same 
diagnostic, prognostic and compound identification methods 

10 as the expression of a single fingerprint gene. 

"Target gene," as used herein, refers to a 
differentially expressed gene involved in tumor progression 
in a manner by which modulation of the level of target gene 
expression or of target gene product activity can act to 

15 prevent and/or ameliorate symptoms of disorders involving 

tumor progression. Tumor progression disorders include, for 
example, disorders involved in human tumors, including, but 
not limited to human melanomas, breast, gastrointestinal, 
such as esophageal, stomach, colon, bowel, colorectal and 

20 rectal cancers, prostate, bladder, testicular, ovarian, 

uterine, cervical, brain, lung, bronchial, larynx, pharynx, 
liver, pancreatic, thyroid, bone, leukemias, lymphomas and 
various types of skin cancers. A target gene can also have 
the characteristics of a fingerprint gene and/or a pathway 

25 gene (as described, below, in Section 5.2). 

5.1.1. METHODS POR THE IDEMTIPICATIOH OP 
nTPFBREMTI^ TrT.Y BXPRE88ED QBMgg 

A variety of methods can be utilized for the 

identification of genes which are involved in tumor 
30 progression. Described in Section 5.1.1.1 are experimental 
paradigms which can be utilized for the generation of 
samples which can be used for the identification of such 
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gen s. Material gener differentially expressed 

ri:irr.=r:.r:'. ............ 

_.».ni<.mi TOR TBI IDEBTIPICATIOH 

paradigms which represent models of tumor 
slon stTtes are described herein. These paradigms 
rruriU::: .r the -tin=ation o. . . Which are 
.i«erentiaii, expres- . norm . «Us;^^^_„^ _ 
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matched cells (a^. c^l^^ ° individual or 
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""'"'TLHo iifferentri expression. Methods for the 
and analyzed ^« described, below, in 

analysis of paradigm material at 

once 'a' 'particular gene has been identified through 
the use of one paradigm, its expression pattern can be 
the use oi on F examDle. by studying its 

further characterized, for example, y 
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S example, be regulated ,„ , ,i^,„ paradigm, 
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distinguishing the roles and relative imp 

0 particular genes in tumor ,,,erred to 

Xn one embodiment of such a paradigm. ^ 

herein as the "in-yiSia" paradigm, cell 
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Characteristics of 
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tumor progression state, whi 

cellular phenotype. utilised herein can exhibit 

The variant cell ^.^,.tics as, for example, a 
0 such tumor progression charaoter ^^^^^^ 

high or .ow metastatic ^^^I'^J ^^^^ „ , distant site 
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:riir.tion or s^^ 

in accordance with «P appropriate 

cell line variants are ^^^^^^^^^ J, is isolated and 
30 conditions, the genes, as described in 

analysed for ^"'"-"^^^.X. 
detail m section 5.1.1.2. 
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Examples of cell lines that can be used as part of 
such in vitro paradigms include but are not limited to 
variants of melanoma cell lines, such as, for example, the 
murine melanoma B16 Fl cell line which exhibits a low 
metastatic potential and the melanoma B16 FIO cell line 
which exhibits a high metastatic potential (Fidler, I. J., 
1973, Nature New Biol 242:148-149); human colon cell lines, 
such as, for example KM12c (tumor cell line with low 
metastatic potential) and the KM20L4 (tumor cell line with 
high metastatic potential; Morikawa K. , et al. , 1988, Cancer 
Research 41:1943-1948); prostatic tumor cell lines, such as, 
for example, DU 145 (non metastatic tumor cell line) and PC- 
3-M (high metastatic potential tumor cell line; Karmali, 
R.A. et al. . 1987, Anticancer Res. 7:1173-1180, and 
Koziowski, J.M. et al. . 1984, Cancer Research 14:3522-3529); 
and breast carcinoma tumor cell lines, such as, for example, 
MCF-7 (non metastatic tumor cell line) and MDA-MB-435 (high 
metastatic potential tumor cell line; Watts C.K. et at, , 
1994, Breast Cancer Res. Treat. 31:95-105 and Rose, D.P. et 
al. . 1993, J. Natl. Cancer Inst. 85:1743-1747). 

As presented in the Example presented in Section 6, 
below, this paradigm has been successfully utilized to 
identify a gene, referred to herein as the 030 gene, which 
is differentially expressed in cells exhibiting a high 
metastatic potential relative to cells exhibiting a low 
metastatic potential. Specifically, the 030 gene is 
expressed at a many-fold higher level in low metastatic 
potential cells relative to cells exhibiting a high 
metastatic potential. 

In a second paradigm, referred to herein as the in 
vivo paradigm, animal models of tumor progression disorders 
can be utilized to discover differentially expressed gene 
sequences. The in vivo nature of such tumor progression 



models can prove to be especially predictive of the 
analogous responses in living patients. 

A variety of tumor progression animal models can be 
used for as part of the in vivo paradigms. For example, 

5 animal models of tumor progression may be generated by 
passaging tumor cells in animals (e^, mice) , leading to 
the appearance of tumors within these animals. 

Additional animal models, some of which may exhibit 
differing tumor progression characteristics, may be 

10 generated from the original animal models described above. 
For example, the tumors which result in the original animals 
can be removed and grown ih vitro. Cells from these in 
vitro cultures can then be passaged in animals and tumors 
resulting from this passage can then be isolated. RNA from 

15 pre-passage cells, and cells isolated after one or more 
rounds of passage can then be isolated and analyzed for 
differential expression. The differential expression can be 
compared to the metastatic potential expression of such 
cells. These cells can now represent cells from different 

20 tumor progression states, or cells within a given tumor 

progression state exhibiting differing degrees of severity 
or aggressiveness. Such passaging techniques can utilizing 
any of the variant cell lines described, above, for the in 

vitro paradigms. 
25 Additionally, animal models for tumor progression 

which can be utilized for such an in viVP paradigm include 
any of the animal models described, below, in Section 5.7.1. 
Other models include transgenic mouse model for melanoma 
(Mintz, B. and Silvers, W.K. , 1993, Proc. Natl. Acad. Sci. 
USA 2fl:8817-8812), transgenic mice which carry specific 
adenomatous polyposis coli (APC) gene mutations (Fodde, R. , 
et al., 1994, proc. Natl. Acad. Sci. USA 91:8969-8973) and 
the transgenic mouse in which the mammary tumor virus LTR/c- 
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Bye gene is anomalously expressed (Leder, A., et al., 1986, 

cells lS:485-495) . 

A third paradigm, referred to herein as the 
..specimen paradigm," utilizes samples from surgical and 
biopsy specimens. Such specimens can represent normal 
tissue, primary, secondary or metastasized tumors obtained 
from patients having undergone surgical treatment for 
disorders involving tumor progression such as, for example, 
melanomas, colon carcinomas, lung carcinomas, prostatic 
cancers and breast cancers. 

surgical specimens can be procured under standard 
conditions involving freezing and storing in liquid nitrogen 
(see, for example, Karmali, R.A., et^, 1983, Br. J. 
cancer 4£:689-696.) RNA from specimen cells is isolated by, 
for example, differential centrifugation of homogenized 
tissue, and analyzed for differential expression relative to 
other specimen cells, preferably cells obtained from the 

same patient. 

in paradigms designed to identify genes which are 
involved in tumor progression, compounds knovm to have an 
ameliorative effect on the tumor progression symptoms can 
also be used in paradigms to detect differentially expressed 
genes. Such compounds can Include Known therapeutics as 
well as compounds that are not useful as therapeutics due to 
their harmful side effects. For example, tumor cells that 
are cultured as explained in this section, above, can be 
exposed to one of these compounds and analyzed for 
deferential gene expression with respect to untreated tumor 
cells, according to the methods described below in Section 
5.1.1.2. m principle, however, according to the paradigm, 
any cell type involved in tumor progression and disorders 
thereof can be treated by these compounds at any stage of 
the tumor progression process. 
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ceixs i-n ,^ „ fibroblasts) which have 

r-oTBoared to unrelated cells (e^, fibrooxas , 
compared t ^^^^ ^^^^ ^^^^^^^ 
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::,s described helow to identify novel therapeutic 
compounds for inhibition of tumor pro,ressxon and the 
tHatment of tumor progression disorders, xnclud.n, 
metastatic diseases. 

in order to identify differentially expressed genes, 
™a either total or mMA, can be isolated from cells 

"tK 1 rrt; r 1::= =0^: not 

« ai^i; - isolation of can be utili«d for the 

Tuiifica'tion Of such ^^^^^^ 

;.usubel, F.M. et^, eds., 1987-1993 '^rr 

John Wiley . Sons Inc New Yor. 

13 incorporated -"-J^j;::^^:^ s r^^^^^^ be 
, Additionally '"^';^:;Z,''Z^ Lose of sKiU in 
processed using techniques well isolation 
the art, such as, for example, the single st p 
tne art, patent No. 

process of ChomcynsKi, P. (1939. "-^ i„ its 

4,843,155), which is incorporated herein by 

° """%"ranscripts within the collected PNA samples which 
.epresent ^ produced by differentially expressed genes can 
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„atl. ^cad. ;:.:Le a^:149-153, Lee. S.W. 
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preferably. //'f ^,2 311, "Wch 1"=-^°""^ 
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CMoh sinale-stranded 
^ 4n the two mRNA sources. Such singj. 
expressed in the two ^^^^g material for the 

reaction (PCR; the experimental e»D ^^^^^^ 

„UU1., K.B., 19S7. U.S. patent «o. 
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allow th primers to amplify only a subset of the mRNA 
derived sequences present in the sample of int rest. This 
is preferred in that it allows more accurate and complete 
visualization and characterization of each of the bands 
representing amplified sequences. 

The 5' primer can contain a nucleotide sequence 
expected, statistically, to have the ability to hybridize to 
cDNA sequences derived from the tissues of interest. The 
nucleotide sequence can be an arbitrary one, and the length 
of the 5' oligonucleotide primer can range from about 9 to 
about 15 nucleotides, with about 13 nucleotides being 
preferred. 

Additionally, arbitrary primer sequences cause the 
lengths of the amplified partial cDNAs produced to be 
variable, thus allowing different clones to be separated by 
using standard denaturing sequencing gel electrophoresis. 

PCR reaction conditions should be chosen which 
optimize amplified product yield and specificity, and, 
additionally, produce amplified products of lengths which 
can be resolved utilizing standard gel electrophoresis 
techniques. Such reaction conditions are well known to those 
of skill in the art, and important reaction parameters 
include, for example, length and nucleotide sequence of 
oligonucleotide primers as discussed above, and annealing 
and elongation step temperatures and reaction times. 

The pattern of clones resulting from the reverse 
transcription and amplification of the mRNA of two different 
cell types is displayed via sequencing gel electrophoresis 
and compared. Differences in the two banding patterns 
indicate potentially differentially expressed genes. 

Once potentially differentially expressed gene 
sequences have been identified via bulk techniques such as, 
for example, those described above, the differential 



express! n of such putatively differentially expressed genes 
should be corroborated. Corroboration can be accomplished 
via, for xample, such well-known techniques as Northern 
analysis, quantitative RT-coupled PCR or RNase protection. 

Upon corroboration, the differentially expressed 
genes can be further characterized, and can be identified as 
target and/or fingerprint genes, as discussed, below, in 

Section 5.1.4. 

Also, amplified sequences of differentially 
expressed genes obtained through differential display can be 
used to isolate the full length clones of the corresponding 
gene. The full-length coding portion of the gene can 
readily be isolated, without undue experimentation, by 
molecular biological techniques well known in the art. For 
example, the isolated differentially expressed amplified 
fragment can be labeled and used to screen a cDNA library. 
Alternatively, the labeled fragment can be used to screen a 
genomic library. 

PCR technology can also be utilized to isolate full- 
length cDNA sequences. As described in this section above, 
the isolated amplified gene fragments (of about at least 10 
nucleotides, preferrably longer, of about 15 nucleotides) 
obtained through differential display have their 5' terminal 
end at some random point within the gene and have 3' 
terminal ends at a position corresponding to the 3' end of 
the transcribed portion of the gene. Once nucleotide 
sequence information from an amplified fragment is 
obtained, the remainder of the gene (1^6^, the 5' end of the 
gene, when -utilizing differential display) can be obtained 
using, for example, RT PCR. 

In one embodiment of such a procedure for the 
identification and cloning of full length gene sequences. 



RNA can be isolated, following standard procedures, from an 
appropriate tissu or cellular source. 

A reverse transcription reaction can then be 
performed on the RNA using an oligonucleotide primer 
5 complementary to the mRNA that corresponds to the amplified 
cloned fragment, for the priming of first strand synthesis • 
Because the primer is anti-parallel to the mRNA, extension 
will proceed toward the 5' end of the mRNA. The resulting 
RNA/DNA hybrid can then be "tailed" with guanines using a 

10 standard terminal transferase reaction, the hybrid can be 

digested with RNAase H, and second strand synthesis can then 
be primed with a poly-C primer. Using the two primers, the 
5' portion of the gene is then amplified using PGR. 
Sequences obtained can then be isolated and recombined with 

15 previously isolated sequences to generate a full-length cDNA 
of the differentially expressed genes of the invention. For 
a review of cloning strategies and recombinant DNA 
techniques which can be used, see, e.g. , Sambrook et al. . 
1989, Molecular Cloning, A Laboratory Manual, Cold Springs 

20 Harbor Press, N.Y.; and Ausubel et al. . 1989, Current 

Protocols in Molecular Biology, (Green Publishing Associates 
and Wiley Interscience, N.Y.). 

5.2 • METHODS FOR THE IDENTIFICATION 
OF PATHWAY GENES 

25 Methods are described herein for the identification 

of pathway genes. "Pathway gene," as used herein, refers to 
a gene whose gene product exhibits the ability to interact 
with gene products involved in tumor progression. A pathway 
gene can be differentially expressed and, therefore, can 

30 have the characteristics of a target and/or fingerprint 
gene. 
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Any method suitabl for detecting protein-prot in 
interactions can b employed for identifying pathway gen 
products by identifying interactions between gene products 
and gene products known to be involved in tumor progression 
and tumor progression disorders, including metastatic 
disorders. Such known gene products can be cellular or 
extracellular proteins. Those gene products which interact 
with such icnown gene products represent pathway gene 
products and the genes which encode them represent pathway 

Among the traditional methods which can be employed 
are co-immunoprecipitation, cross-linking and co- 
purification through gradients or chromatographic columns. 
Utilizing procedures such as these allows for the 
identification of pathway gene products. Once identified, 
pathway gene product can be used, in conjunction with 
standard techniques, to identify its corresponding pathway 
gene. For example, at least a portion of the amino acid 
sequence of the pathway gene product can be ascertained 
using techniques well known to those of skill in the art, 
such as via the Edman degradation technique (see, s^. 
creighton, 1983, Proteins: Structures and Molecular 
principles, W.H. Freeman & Co., N.Y., pp.34-49). The amino 
acid sequence obtained can be used as a guide for the 
generation of oligonucleotide mixtures that can be used to 
screen for pathway gene sequences. Screening can be 
accomplished, for example by standard hybridization or PCR 
techniques. Techniques for the generation of 
Oligonucleotide mixtures and the screening are 
(Se!, Ausubel, and PCR Protocols: A Guide to 

Methods and Applications, 1990, Innis, M. sL^. e^^' 
Academic Press, Inc., New York). 



t 

Additionally, methods can be employed which result 
in the simultaneous identification of pathway genes which 
encode th protein interacting with a protein involved in 
tumor progression and tumor progression disorders, including 
5 metastatic diseases. These methods include, for example, 
probing expression libraries with labeled protein known or 
suggested to be involved in metastatic diseases using this 
protein in a manner similar to the well known technique of 
antibody probing of Agtll libraries. 

10 One method which detects protein interactions in 

vivo , the yeast two-hybrid system, is described in detail 
for illustration only and not by way of limitation. One 
version of this system has been described (Chien et al* , 
1991, Proc. Natl. Acad. Sci, USA, M:9578-9582) and is 

15 commercially available from Clontech (Palo Alto, CA) . 

Briefly, utilizing such a system, plasmids are 
constructed that encode two hybrid proteins: the first 
hybrid protein consists of the DNA-binding domain of a 
transcription factor f e.o. . activation protein) fused to a 

20 known protein, in this case, a protein known to be involved 
in tumor progression, and the second hybrid protein consists 
of the transcription factor's activation domain fused to an 
unknown protein that is encoded by a cDHA which has been 
recombined into this plasmid as part of a cDNA library. The 

25 plasmids. are transformed into a strain of the yeast 

Saccharomyces cerevisiae that contains a reporter gene 
( e.g. . lasii) whose expression is regulated by the 
transcription factor's binding site. Either hybrid protein 
alone cannot activate transcription of the reporter gene. 

30 The DNA binding hybrid protein cannot activate transcription 
because it does not provide the activation domain function 
and the activation domain hybrid protein cannot activate 
transcription because it lacks the domain required for 



binding t it. "f'* f " Mndln, .ite) . 
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0,0 gene products, ^^^^J'^J^^^ „ metastatic 
and tu»or progression ^ Total 

diseases, can be used as the encoding an 
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30 product can be .ade usxng «e«o*s ^^^^^^^ ^^.m, 
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the activation domain of GAL4. This library can be co- 
transformed along with the bait gene-GAL4 fusion plasmid 
into a yeast strain which contains a lacZ gene whose 
expression is controlled by a promoter which contains a GAL4 

5 activation sequence. A cDNA encoded protein, fused to GAL4 
activation domain, that interacts with the bait gene product 
will reconstitute an active GAL4 transcription factor and 
thereby drive expression of the lacZ gene. Colonies which 
express lacZ can be detected by their blue color in the 

10 presence of X-gal. cDNA containing plasmids from such a 
blue colony can then be purified and used to produce and 
isolate the bait gene product interacting protein using 
techniques routinely practiced in the art. 

Once a pathway gene has been identified and 

15 isolated, it can be further characterized as, for example, 
discussed below, in Section 5.3. 

5.3. CHARACTERIZATION OF DIFPERBMTIALLY 

ByPRESSED AND PATH WAY GENES 

Differentially expressed genes, such as those 
20 identified via the methods discussed, above, in Section 5.1, 
and pathway genes, such as those identified via the methods 
discussed, above, in Section 5.2, above, as well as genes 
identified by alternative means, can be further 
characterized by utilizing, for example, methods such as 
25 those discussed herein. Such genes will be referred to 
herein as "identified genes." 

Analyses such as those described herein, yield 
information regarding the biological function of the 
identified genes. An assessment of the biological function 
30 of the differentially expressed genes, in addition, will 
allow for their designation as target and/or fingerprint 
genes . 
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sp clflcally, any of th. dltter.ntlaUy .xpre««l 
„„„ who.. furth.r Characterization indicate, that a 
Lulation ot th. ,en,'s expression or a ■»»*>'^^*"";' 
,ene product's activity can Inhibit tu^or progression will 
rdesignated "target genes," as defined, abov., in Section 
S i such target genes and target gene products, along with 
those discussed below, will constitute the 
compound discovery strategies discussed, below, in Section 
5 8 Further, such target genes, target gene product. 

;l;or BodulaUng compounds can be used as P^rt of the tu.or 
Trogression disorder treatment .ethod. described, below, in 

^""^""^y of the differentially expressed genes whose 
further characterization indicates that such modulations 
Toernot positively affect tumor progression, but whose 
expression pattern contributes to a gene expression 
Xerprint- pattern correlative of, for example, umor 
progression will be designated a "fingerprint gene " 
"Fingerprint patterns" will be more fully discussed, below, 
in section 5.11.1. " should be noted that each of the 
CefUs also function as fingerprint genes, a. can 
all or a portion of the pathway genes. 

It Should further be noted that the pathway genes 
can also be characterized according to 
those described herein. Those pathway genes which yield 
information indicating that they are differentially 
e^ressed and that modulation of the gene's expression or a 
:riation Of th. gene product's activity - 
progre..ion or ameliorate tumor progression-associated 
symptom, will also be designated "target gen..." Su^ 
t«get genes and target gene products, along with tho.e 
disc^^sed above, will constitute the — of the compound 
discovery strategies discussed, below, in Section 5.8 and 
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„„ b used a, part of the treatment ..thods described In 

""'°"xtlo"7be addUicnaXly noted that the 

.Ton of one or more of the pathway ,enes can 
^TafrraoK : L erential egression, hut evidence that 
reveal a lacK of expression can, 

modulation of 9ene - progression. In 

nonetheless ^.e iorate symp ^^^^^^ ^^^^^ ^^^^ ^ 

such cases, these 9en ^^^^ strategies of 

considered a focus of tne f treatment 
section 5.8, below and can be used as part 

A v^j-xc^j piT-Qt the nucleotide 

iH^antified genes. First:, ta« 
Characterize the xdent f g ^^^^^ ^ ^^^^^^^ 

sec^ence ^^^^^^^^^^^ J,, „ell known to those of skill 
utilizing standard ""^"'^ characterize such genes. 

the art, =-J:;;::;;rtriaentif led gene, can reveal 
; For example, the sequenc ^ ^^^^^ 

ridtrorTtrn regrdC:: bllglcal faction Of the 

"-""^oX rr;;sis of ,r 
0 ; ::rutri:irrd«: weu Known 

can b. conducted, ""^^^^^^ techniques can include, 

to those of sKill in the art. ^nd BKase 

for example, Northern analyses, RT coupl 
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protection techniques. Such analyses provide information as 
to whether the identified genes are expressed in tissues 
expected to contribute to tumor progression. Such analyses 
can also provide quantitative information regarding steady 
5 state mRNA regulation, yielding data concerning which of the 
identified genes exhibits a high level of regulation in, 
preferably, tissues which can be expected to contribute to 
tumor progression. Additionally, standard ia situ 
hybridization techniques can be utilized to provide 

10 information regarding which cells within a given tissue 

express the identified gene. Such an analysis can provide 
information regarding the biological function of an 
identified gene relative to given tumor progression in 
instances wherein only a subset of the cells within the 

15 tissue is thought to be relevant to the disorder. 

Third, the sequences of the identified genes can be 
used, utilizing standard techniques, to place the genes onto 
genetic maps, e.g. . mouse (Copeland, N.G. and Jenkins, N.A., 
1991, Trends in Genetics 7:113-118) and human genetic maps 

20 (Cohen, D., et al. , 1993, Nature 366 : 698-701) . Such mapping 
information can yield information regarding the genes' 
importance to human disease by, for example, identifying 
genes which map within genetic regions to which known 
genetic tximor progression disorders map. 

25 Fourth, the biological function of the identified 

genes can be more directly assessed by utilizing relevant in 
vivo and in vitro systems. In vivo systems can include, but 
are not limited to, animal systems which naturally exhibit 
symptoms of tumor progression, such as metastatic disease, 

30 or ones which have been engineered to exhibit such symptoms. 
For example, tvimor progression animal models may be 
generated by injecting animals, such as mice, with tumor 
cells, some of which will give rise to tumors within the 



injected animals. Axaong the cells which may be utilized for 
such a purpos are cells listed, above, in Section 5.1.1.1, 
such as the B16 cell variants. 

The role of identified gene products (e^, 030 gene 
products) can be determined by transfecting cDNAs encoding 
these gene products into appropriate cell lines, such as, 
for example, a B16 cell line variant, and analyzing the 
effect on tumor progression characteristics. For example 
the role/function of genes important in the progression of 
human colorectal cancers are assessed using the KM12C ( ow 
metastatic potential) and KM12L4 (highly metastatic) cells 
implanted into nude mice spleens and the number of hepatic 
tumors that develop are determined. The function of genes 
isolated using human colorectal tumors and their hepatic 
metastases are assessed by expressing the gene in the 
appropriate KM12 variant. Additionally, the role/function 
of genes important in the progression of prostatic and 
breast cancers are assessed using appropriate cell lines 
described above in Section 5.1.1.1. Importantly, the 
role/function of genes important in the progression of 
melanoma, colon, prostate and breast cancers in humans are 
assessed using biopsy specimens from patients having 
undergone surgical treatment, as described in Section 

5.1.1.1. above. 

Further, such systems can include, but are not 
limited to transgenic animal systems such as those 
described, above, in Section 5.7.1 below. In 3^ systems 
can include, but are not limited to, cell-based systems 
comprising cell types known or suspected of contributing to 
tumor progression. Such cells can be wild type cells, or 
can be non-wild type cells containing modifications Icnown to 
or suspected of, contributing to tumor progression. Such 
systems are discussed in detail, below, in Section 5.7.2. 
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. to identify and isolate the human homologue of 
The proc dure to identity section 5.7.3. 

the fo.y030 g ne is described below „ 
T« further characterizing the bioiogi^ 
in further ^^^ession of these genes can be 

the identified , ^ systems, i^, 

• i-u4« -»->>^ in VIVO ana/o3- iH . ^-^^^^ 
nodulated within the in yi_ subsequent effect 

eithe. ove.- =-~r"-^^^^^^^^ - °' 

on the system then „„^„l,ted by 

the product of the identified g activity in the 

either increasing or ""tntereit! and its 

in ii22 and/or in !£iSEa system of interest, 

subsequent effect then assayed. 

The IntorBatlon obtained through such 
Characterizations can suggest 

treatment of tumor P"^«-":;",2::::;"^:::ther. relevant 
disorders involving '^/^^ ^ /tumor cells involving 
methods for controlling the spre ,„,o„ation 

the gene of i---^:;-^^^^^^^^ pimple, treatment 
obtained from such character! 

can i-i"-r\!=<'"^;::::,: ratirprocedures such as those 
product activity, ^haracte^^ modulation should 

described herein can indicate wher „ion or 

involve an increase or a ^ ^f in«rest. Such 

activity of the gene or gene P"^-* f J\„ 5.9. 
methods of treatment are discussed, below, 

5 nim rfiTBTTiv oBMBS . 

.fferentially -n:/;rar:e;^^^^ - 

T^:::^:^^^-^- -ve, are described 
0 herein. oppressed and pathway genes of 

i?i„veriorr:^ -X i- ^- - — 



seqbenc for the differentially expressed fomy030 gene is 
shownSlKFIGS. 2 and 3A and 3B. Specifically, FIG. 2 
depicts ti^cleotide sequence (SEQ ID N0:1) of the 
amplified cD^S^nd initially identified via differential 
display analysis?>ich is referred to herein as roBy030 
FIGS. 3A AND 3B depS^^ nucleotide sequence (SEQ ID N0:2) 
of a fomy030 cDNA clone^h was isolated using a romy030 
probe. The deduced amino a^Usequence also is shown in 
FIGS. 3A and 3B (SEQ ID NO : 3 ) . 5 shows the nucleotide 

(SEQ ID N0:6) and deduced amino aci^^quences (SEQ ID N0:7) 
of a fohy030 CDNA clone which was isol^using the entire 
.ouse fomy030 cDNA as a probe. FIG. 6 sh^^ alternative 
splice form of fohy030 (SEQ ID N0s:8 and 9) . \ 

Table 1 summarizes information regarding the further 
characterization of the differentially expressed fomy030 
gene of the invention. Table 2 lists £_coU clones, 
deposited with the Agricultural Research Service Culture 
collection (NRRL) , which contain sequences found within the 

genes of Table 1. ^ j. ^ 

in Table 1, the paradigm used initially to detect 
the differentially expressed gene is described under the 
column headed "Paradigm of Original Detection." In this 
column, "t" indicates that gene expression is higher (i^, 
there is a greater steady state amount of detectable mRNA 
produced. by a given gene) in the indicated cell type 
relative to the other cell type, while "I" indicates that 
gene expression is lower (i^, there is a lower steady 
state amount of detectable mRNA, produced by a given gene) 
in the indicated cell type relative to the other -U type. 
AS indicated under this column, the 030 gene was initially 
identified via a differential screen between B16 Fl (low 
metastatic potential cells) and B16 FIO (high -tastatic 
potential cells) in which 030 gene expression is lower in 
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the high metastatic potential B16 FIO cell line than in the 
low metastatic potential B16 Fl cell line. 

The Table 1 column headed "Paradigm Expression 
Pattern" lists the cell type in which gene expression was 
5 initially detected. In the case of the 030 gene, gene 

expression was first detected in melanoma (1^6^, B16) cells. 
"Detectable" as used herein, refers to levels of mRNA which 
are detectable, via standard differential display. Northern, 
RT-coupled PGR and/or RNase protection techniques which are 

10 well known to those of skill in the art. 

Cell types in which differential expression was 
detected are summarized in Table 1 under the column headed 
"Cell Type Detected in." In the case of the 030 gene, 
expression has additionally been detected within melanocyte 

15 cells. 

Additionally, in instances wherein the genes contain 
nucleotide sequences similar or homologous to sequences 
found in nucleic acid databases, references to such 
similarities are listed. Because the 030 gene is a novel 
20 gene, i^, no homologous gene sequences are present in the 
published databases, no such reference is listed. 
^~—y Finally^ucleotide sequences contained within the 
di'ffirentially^SSjfess^d^nes are listed in the Figures 
indicated under the headi^r~*^S^a:" In the case of the 
fomy030 gene, such sequences arellsted^ FIGS. 2 and 3 A 
and 3B, and for fohy030, in FIGS. 5 and 6. 

genes listed in Table 1 can be obtained using 
known to those skilled in the art, 
including, but not liidtld~^ui^..the use of appropriate probes 
to detect the genes within an apprSpri^tecDNA or gDNA 
(genomic DNA) library. (See, for exampl^T^Saiabrook e£_ai^, 
1989, Molecular Cloning: A Laboratory Manual, Cold-Spring 
Harbor Laboratories, which is incorporated by reference 



25 



'^cloning metho 



30 
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15 



hereliKin its entirety) . Probes for the novel sequences 
report dfiireiacan be obtained directly from the isolated 

clones deposit^T^i^ir^^-^S^ ^^^^ ^ 

below. Alternatively, oligon;:^liW.i4e^bes for the novel 
genes can be synthesized, using techniqili^-^A^ell known to 
those of skill in the art, based on the DNA segtlences 
disclosed herein in FIGS. 2, 3A, 3B, 5, and 6. 

The probes can be used to screen cDNA libraries 
prepared from an appropriate cell or cell line in which the 
gene is transcribed. For example, the genes described 
herein that were detected in melanocyte cells can be cloned 
from a cDNA library prepared from melanocyte cells such as, 
for example, melan-c (Hodgkinson, C.A., et al., 1993, Cell 
74-395-404), the cDNA libraries developed from the human 
"ianoma cell line A2058 (Clontech, Palo Alto, CA) and cDNA 
libraries developed from the murine melanoma cell Ime K1735 

T» T«na CA) Genomic DNA libraries can be 
(Stratagene, La Jolia, ca; . vjenwuij. 

prepared from any source. 




Table 2, below, lists an Sr coli strain as deposited 
with the NRRL, which contains an isolated plasmid fomy030 
clone. The clone contains a fomy030 cDNA in a pBlueScript 
SK- (Stratagene, La Jolla, CA) vector which was isolated 
from a mouse melanocyte cDNA library screened with a romy030 
probe, as described in Section 6.2, below. 



10 



TABLE 2 



GENE 


STRAIN DEPOSITED 
WTTH NRRL 


PLASMID CLONE CONTAINED | 
WITHIN DEPOSITED STRAIN 


fomy030 
fohy030 


FOMY030 


pFOMY030 1 



15 




25 



AX^used herein, "differentially expressed gene" 
( i.e. . tar^and fingerprint genes) or "pathway gene" 
refers to (a)Xgene containing: at least one of the DNA 
sequences disclo^Hherein (as shown in FIGS. 2, 3A, 3B, 5, 
and 6) or contained i« the clones listed in Table 2, as 
deposited with the NRRlX^) any DNA sequence that encodes 
the amino acid sequence en^b^ed by: the DNA sequences 
disclosed herein (as shown in>^S. 2, 3A, 3B, 5, and 6), 
contained in the clones, listed Vrable 2, as deposited 
with the NRRL or contained within th^coding region of the 
gene to which the DNA sequences disclo^^herein (as shown 
in FIGS. 2, 3A, 33, 5, and 6) or containeV^Ti the clones 
listed in Table 2, as deposited with the NRR^K^long; (c) 
any DNA sequence that hybridizes to the complement of : the 
coding sequences disclosed herein (as shown in FIGSn.^ 3A, 
33, 5, and 6), contained in clones listed in Table 2,>s 
deposited with the NRRL, or contained within the coding 
region of the gene to which the DNA sequences disclosed 



15 




5 and 6) or contained 

1* QtXat 68°C (Ausubel F.M. e£,.^i_' 
O.lxSSC/0.1% SD^sat „.„ioav Vol. I, Green 

cu„ent -t.coi>V;--- r,r;i.ey . sons. Xnc, 
pu.lisnln, --=--V::,' ,„„aes a gene product 
Hew Yort, at p. ^duot encoded by a gene 

functionally -^"^^^"^ V ^.^enoe that hybridizes to 
°' (a,. Xse Jnces disclosed herein, 

the complement ot. the contained in the 

(as shown in FISS. 2, 3A, ' V „ith the NBBL or 
clones listed in Table 2, P =^ ^^.^^ ^NA 

contained within - ^ "X „ .OS. 2. 3A. 3B. 

sequences disclosed herein (a ^^^^^ ^_ 

and 6) or contained in the = "^^^^^^^^ stringent 
deposited with the ^:r;„:e„t\ditions, ^. 

conditions, such - -^^^^'/f^ ,,ce (xusube^N^. 
washing in o.2xSSC/o.i% SDS at 4 cducV^tionally 

preferably DNX molecules, that hy , through 

therefore the complements "i.^^-^^ ^.rridlzation 
(d, , in the preceding paragraph. Such hy 3„i„gent, 
londitions can be ^^^^J 

as described above. In ,»oUgos-), highly 

molecules are deo^cyoligonucleot.des ( g^^^,_^^ 

stringent conditions can refer £^ 

.xSSC/0.05* sodium Py"P''-P;^" ' oc for 20-base 
oligos), 48°c (torl7-base ougos), 55 
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20 



25 



30 



oligos), and 60<>C (for 23-base oligos) . These nuo ei= acid 
molecules can act a= target gene antlsene. molecules, 
ueifuX, tor example, in t«,et gene regulation and/or as 
antisense primers in amplification reactions of target, 
fingerprint, and/or pathway gene nucleic acid sequences. 
Further, such sequences can be used as part of r.bozyme 
"d or «iple helix sequences, also useful for target gene 
regulation. Still further, such molecules can be used as 
components of diagnostic methods whereby tumor progression 

disorders can be detected. 

The invention also encompasses (a, DNA vectors that 
contain any of the foregoing coding sequences 
complements (i^, antisense, , (b, DNX expression vectors 
that contain any of the foregoing coding =-<>"^"=" 
operatively associated with a regulatory element that 
directs the expression of the coding seciuences; and (c, 
genetically engineered host cells that 
foregoing coding sequences operatively associated with a 
foregoing , directs the expression of the coding 

regulatory element that directs tne e t. 
seLnces in the host cell. As used herein, regulatory 
T^Zs include but are not limited to inducible and non- 

" b e ::omoters. enhancers, operators and other elements 
Known to those sKiUed in the art that drive and re^ ate 
expression. The invention includes fragments of any of 
DNA sequences disclosed herein. 

in addition to the gene sequences described above, 
homologues of these gene sequences as can, 
present in other species, preferably human in 
Lerein the above described gene sequences are not human 
gene sequences, can be identified and can readily b. 
isolated, without undue experimentation, by 
biological techniques well known In the art. 
can exist genes at other genetic loci within the genome that 
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10 



15 



M 20 



25 



enooa p«telns which have extensive ho»oio« to one o^ nore 
aoLins of such ,ene proaucts. These ,enes can also be 
iH^ni-ified via similar techniques. 

nter«t. Hybridization conditions will be o£ a lower 

organism different ir .i,„rnativelY, the labeled 

labeled sequence was derived. 

► -^r, h. used to screen a genomic library derive 
fragment can be used t appropriately 

from the organism c -^-^ „„,,,,ons will 

stringent =-^^"°-; ^J^'^^'^^^ ,„ ..t, and will vary 

be well known to those organisms from which 

predictably depending on the specific g 

the library and the labeled sequences are l"^"^ 

Tid nee regarding such conditions see, for e»mp e 

TambrooK^. 1-,. ^.tsreH^. 

Manual Cold Springs Harbor Press, N.Y., ana au 

! : ;urrent Protocols in Molecular Biology. =reen 

publishing Associates and Wiley '-^'^^^'^^ntuil^ 
Further, a previously un3a,own -^««"""'"^ 

:r:r iLnrr::;.^ 

human or non-human cell lines or tis 
to express a differentially expressed or pathway ge 
^i!^ L PCR product can be subcloned and sequenced to 
) allele. The fck ^-nresent the sequences 

insure that the amplified /J//^^^^^^^^^ Tcleic 

of a differentially expressed or pathway ge 



acid sequence. 
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*.\,^ry w used to isolate a full 
The PGR fragment can then b usea t 

a variety of methods. For example, 

» he used to screen a genomic library, 
fragment can be usea t:o „tilized to Isolate full 

PCR technology can also be utilized 

.engtb cOK. seguences. Por -7-' -^//^^r 
following standard ---- ^^^^^^^^^ .L,,.„ «n be 

or tissue source J re ^ p.^.er 

""m: f . "st . L Of tne amplined fragment for 
specific tor synthesis. The resulting 

digested wi ^^^^ sequences 

be primed with a poiy <. f »,,iiv be isolated. 

*,.,m of the amplified fragment can easily oe 
upstream of the a p ^^^^^ ^an be used, see 

Por a review of =1°-"' cloning. . ^boratory 

sambrook sS_al^' "° Ausubel sSJl-. 

Manual, cold Springs Harbor Press, 
1,89, current Protocols in Molecular Biology, (Green 
Publ shlng Associates and Wiley Interscience, N.Y.) . 
publishing ^^^^ differentially expressed or 

identified is the normal, or wild type, gene, 
pathway '7;*;^"^ '^^^^o ^solate mutant alleles of the gene, 
r fn U -"n is preferable in P"-ses -/Orders 

Lich are Known or -::ra:erfrm\:dr;ir ^^^^^^^^^ Known 

Ts^er :o":raTerotype Which ---3 « — 

alleles and mutant aiiexe 
» r,».orrresaion symptoms. Mutant aiieies ai 
, progression symp ^he therapeutic and 

products can then be utiiizea i. 

diagnostic assay systems described below. 



A cDNA of a mutant gene can be isolated, for 
example, by using PGR, a technique which is well-known to 
one skilled in the art. In this case, the first cDNA strand 
can be synthesized by hybridizing a oligo-dT oligonucleotide 
to mRNA isolated from tissue known or suspected of being 
expressed in an individual putatively carrying the mutant 
allele, and by extending the new strand with reverse 
transcriptase. The second strand of the cDNA can then be 
synthesized using an oligonucleotide that hybridizes 
specifically to the 5'- end of the normal gene. Using these 
two primers, the product is then amplified via PGR, cloned 
into a suitable vector, and subjected to DNA sequence 
analysis through methods well-known to one skilled in the 
art. By comparing the DNA sequence of the mutant gene to 
that of the normal gene, the mutation(s) responsible for the 
loss or alteration of function of the mutant gene product 
can be ascertained. 

Alternatively, a genomic or cDNA library can be 
constructed and screened using DNA or RNA, respectively, 
from a tissue known to or suspected of expressing the gene 
of interest in an individual suspected of or known to carry 
the mutant allele. The normal gene or any suitable fragment 
thereof can then be labeled and used as a probe to identify 
the corresponding mutant allele in the library. The clone 
containing this gene can then be purified through methods 
routinely practiced in the art, and subjected to sequence 
analysis as described, above, in this Section. 

Additionally, an expression library can be 
constructed utilizing DNA isolated from or cDNA synthesized 
from a tissue known to or suspected of expressing the gene 
of interest in an individual suspected of or known to carry 
the mutant allele. In this manner, gene products made by 
the putatively mutant tissue can be expressed and screened 
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using standard antibody screening techniques in conjunction 
with antibodies raised against the normal gene product, as 
described, below, in Section 5.2.3. (For screening 
techniques, see, for example, Harlow, E. and Lane, eds., 
1988, "Antibodies: A Laboratory Manual," Cold Spring Harbor 
Press, Cold Spring Harbor.) In cases where the mutation 
results in an expressed gene product with altered function 
( e.g. , as a result of a missense mutation) , a polyclonal set 
of antibodies are likely to cross-react with the mutant gene 
product. Library clones detected via their reaction with 
such labeled antibodies can be purified and subjected to 
sequence analysis as described in this Section, above. 

Taking the fomy03 0 gene as an example, the fomyOSO 
human homolog can be isolated by a variety of methods. 
First, sequences found in a murine fomyOBO cDNA can be 
utilized as hybridization probes to detect hximan fohy030 
sequences. This can be accomplished, for example, by 
probing Southern blots containing total human genomic DNA 
with a labelled fomy030 probe. Once it is verified that the 
probe being utilized detects the human 030 gene, one of 
skill in the art can employ any of several routine 
approaches to isolate the human gene without undue 
exper imentat ion • 

In one approach, appropriate human cDNA libraries 
can be screened. Such cDNA libraries can, for example, 
include human melanocyte, human retina and fetal human brain 
cDNA libraries. For example, panels of human melanoma cells 
(such as, for example, SK-MEL-2, ATCC 68-HTB; SK-MEL-5, ATCC 
70-HTB; SK^MEL-28, ATCC 72-HTB; G-361, ATCC 1424-CRL; and/or 
HT-144 [63-HTB] cells) can be screened for 030 expression 
by, for example, Northern blot analysis. Upon detection of 
030 transcript, cDNA libraries can be constructed from RNA 
isolated from the appropriate cell line, utilizing standard 



techniques well known to those of skill in the art. The 
huinan cDNA library can then be screened with a 030 probe in 
order to isolate a human romy030 cDNA. As described below, 
this method was used to determine the human fohy030 cDNAs in 
5 FIGS. 5 and 6. 

Alternatively, a human total genomic DNA library can 
be screened using 030 probes. 030-positive clones can then 
be sequenced and, further, the intron/exon structure of the 
human 030 gene may be elucidated. Once genomic sequence is 
10 obtained, oligonucleotide primers can be designed based on 
the sequence for use in the isolation, via, for example RT- 
coupled PGR, of human 030 cDNA. 

The procedures described in these approaches are 
£0 routine and have been described in detail in Sections 

S 15 5.1.1.2, 5.3 and 5.7.2. 

y = 

5.5. DIFFERENTIALLY EXPRESSED 
<f AMD PATHWAY GENE PRODUCTS 

'j^ Differentially expressed and pathway gene products 

O include those proteins encoded by the differentially 

20 expressed and pathway gene sequences described in Section 
y 5.2.1, above, as for example, the peptide listed in FIG. 3. 

Specifically, differentially expressed and pathway gene 
' products can include differentially expressed and pathway 

gene polypeptides encoded by the differentially expressed 
25 and pathway gene sequences contained in the clones listed in 
Table 2, above, as deposited with the NRRL, or contained in 
the coding regions of the genes to which DNA sequences 
disclosed herein (in FIGS. 3A, 3B, 5, and 6) or contained in 
the clones, listed in Table 2, as deposited with the NRRL, 
30 belong, for example. 

In addition, differentially expressed and pathway 
gene products can include proteins that represent 
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functionally equivalent gene products. Such an equivalent 
differentially expressed or pathway gene product can contain 
deletions, additions or substitutions of amino acid residues 
within the amino acid sequence encoded by the differentially 
expressed or pathway gene sequences described, above, in 
section 5.2.1, but which result in a silent change thus 
producing a functionally equivalent differentially expressed 
on pathway gene product. Amino acid substitutions can be 
made on the basis of similarity in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the 
amphipatic nature of the residues involved. For example, 
nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, 
serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine; positively charged (basic) amino acids include 
arginine, lysine, and histidine; and negatively charged 
(acidic) amino acids include aspartic acid and glutamic 
acid. "Functionally equivalent," as utilized herein, refers 
to either a protein capable exhibiting a substantially 
similar in vivo activity as the endogenous differentially 
expressed or pathway gene products encoded by the 
differentially expressed or pathway gene sequences described 
in section 5.2.1, above. Alternatively, when utilized as 
part of assays such as those described, below, in Section 
5.3, "functionally equivalent" can refer to peptides capable 
of interacting with other cellular or extracellular 
molecules in a manner substantially similar to the way in 
which the corresponding portion of the endogenous 
differentially expressed or pathway gene product would. 

The differentially expressed or pathway gene 
products can be produced by synthetic techniques or via 
recombinant DNA technology using techniques well known in 



the art. Methods for preparing the differentially expressed 
or pathway gene polypeptides and peptides of the invention 
by xpressing nucleic acid encoding differentially expressed 
or pathway gene sequences are described herein. Methods 
which are well known to those skilled in the art can be used 
to construct expression vectors containing differentially 
expressed or pathway gene protein coding sequences and 
appropriate transcriptional/ translational control signals. 
These methods include, for example, in vitro recombinant DNA 
techniques, synthetic techniques and in vivo 
recombination/genetic recombination. See, for example, the 
techniques described in Maniatis et^, 1989, Molecular 
Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, 
N Y which is incorporated by reference herein in their 
entirety, and Ausubel, 1989, suees. Alternatively, RNA 
capable of encoding differentially expressed or pathway gene 
protein sequences can be chemically synthesized using, for 
example, synthesizers. See, for example, the techniques 
described in "Oligonucleotide Synthesis," 1984, Gait, M.J. 
ed., IRL press, oxford, which is incorporated by reference 

herein in its entirety. 

A variety of host-expression vector systems can be 
utilized to express the differentially expressed or pathway 
gene coding sequences of the invention. Such host- 
expression systems represent vehicles by which the coding 
sequences of interest can be produced and subsequently 
purified, but also represent cells which can, when 
transformed or transfected with the appropriate nucleotide 
coding sequences, exhibit the differentially expressed or 
30 pathway gene protein of the invention in situ. These 
include but are not limited to microorganisms such as 
bacteria (e^, 1. ssli, subtiUs) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
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expression vectors containing differentially expressed or 
pathway gene prot in coding sequences; yeast (a^a^, 
g;,r.rharomvces . Pichia ) transformed with recombinant yeast 
expression vectors containing the differentially expressed 
or pathway gene protein coding sequences; insect cell 
systems infected with recombinant virus expression vectors 
( e.g. . baculovirus) containing the differentially expressed 
or pathway gene protein coding sequences; plant cell systems 
infected with recombinant virus expression vectors (e^, 
cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) 
or transformed with recombinant plasmid expression vectors 
( e.g. . Ti plasmid) containing differentially expressed or 
pathway gene protein coding sequences; or mammalian cell 
systems (g^, COS, CHO, BHK, 293, 3T3) harboring 
recombinant expression constructs containing promoters 
derived from the genome of mammalian cells (e^^, 
metallothionein promoter) or from mammalian viruses (e^, 
the adenovirus late promoter; the vaccinia virus 7.5K 
promoter) . 

in bacterial systems, a number of expression vectors 
can be advantageously selected depending upon the use 
intended for the differentially expressed or pathway gene 
protein being expressed. For example, when a large quantity 
of such a protein is to be produced, for the generation of 
antibodies or to screen peptide libraries, for example, 
vectors which direct the expression of high levels of fusion 
protein products that are readily purified can be desirable, 
such vectors include, but are not limited, to the coli 
expression vector pUR278 (Ruther et al. , 1983, EMBO J. 
2:1791), in which the differentially expressed or pathway 
gene protein coding sequence can be ligated individually 
into the vector in frame with the lacZ coding region so that 
a fusion protein is produced; pIN vectors (Inouye & Inouye, 
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1985, Nucleic Acids Res. 13:3101-3109; Van Heeke & Schuster, 
1989, J. Biol. Chem. 264 :5503-5509) ; and the like. pGEX 
vectors can also be used to express foreign polypeptides as 
fusion proteins with glutathione S-transf erase (GST) . In 
5 general, such fusion proteins are soluble and can easily be 
purified from lysed cells by adsorption to glutathione- 
agarose beads followed by elution in the presence of free 
glutathione. The pGEX vectors are designed to include 
thrombin or factor Xa protease cleavage sites so that the 
10 cloned target gene protein can be released from the GST 
moiety. 

In an insect system, Autographa californica nuclear 
polyhedrosis virus (AcNPV) is used as a vector to express 
foreign genes. The virus grows in Spodoptera frugiperda 

15 cells. The differentially expressed or pathway gene coding 
secpience can be cloned individually into non-essential 
regions (for example the polyhedrin gene) of the virus and 
placed under control of an AcNPV promoter (for example, the 
polyhedrin promoter) . Successful insertion of 

20 differentially expressed or pathway gene coding sequence 
will result in inactivation of the polyhedrin gene and 
production of non-occluded recombinant virus ( i.e. , virus 
lacking the proteinaceous coat coded for by the polyhedrin 
gene) . These recombinant viruses are then used to infect 

25 Spodoptera frugiperda cells in which the inserted gene is 
expressed f e.a. . see Smith et al. . 1983, J. Viol. 16:584; 
Smith, U.S. Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based 
expression systems can be utilized. In cases where an 

30 adenovirus is used as an expression vector, the 

differentially expressed or pathway gene coding sequence of 
interest can be ligated to an adenovirus transcription/ 
translation control complex, e.g. , the late promoter and 



tripartite 1 ader sequence. This chimeric gene can then be 
inserted in the adenovirus genome by in vitEfi or in vivg 
recombination. Insertion in a non-essential region of the 
viral genome (e^, region El or E3) will result in a 
recombinant virus that is viable and capable of expressing 
differentially expressed or pathway gene protein in infected 
hosts (e^, see Logan & Shank, 1984, Proc. Natl. Acad. Sci. 
USA 81:3655-3659). Specific initiation signals can also be 
required for efficient translation of inserted 
differentially expressed or pathway gene coding sequences. 
These signals include the ATG initiation codon and adjacent 
sequences. In cases where an entire identified gene, 
including its own initiation codon and adjacent sequences, 
is inserted into the appropriate expression vector, no 
additional translational control signals can be needed. 
However, in cases where only a portion of the identified 
coding sequence is inserted, exogenous translational control 
signals, including, perhaps, the ATG initiation codon, must 
be provided. Furthermore, the initiation codon must be in 
phase with the reading frame of the desired coding sequence 
to ensure translation of the entire insert. These exogenous 
translational control signals and initiation codons can be 
of a variety of origins, both natural and synthetic. The 
efficiency of expression can be enhanced by the inclusion of 
appropriate transcription enhancer elements, transcription 
terminators, etc., (see Bittner et^, 1987, Methods in 
Enzymol. 1S3.: 516-544) . 

in addition, a host cell strain can be chosen which 
modulates the expression of the inserted sequences, or 
modifies and processes the gene product in the specific 
fashion desired. Such modifications glycosylation) 
and processing (e^, cleavage) of protein products can be 
important for the function of the protein. Different host 
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cells hav characteristic and specific mechanisms for the 
post-translational processing and modification of proteins. 
Appropriate cell lines or host systems can be chosen to 
ensure the correct modification and processing of the 
5 foreign protein expressed. To this end, eukaryotic host 
cells which possess the cellular machinery for proper 
processing of the primary transcript, glycosylation, and 
phosphorylation of the gene product can be used. Such 
mammalian host cells include but are not limited to CHO, 
10 VERO, BHK, HeLa, COS, MDCK, 293, 3T3 , WI38, etc. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
lines which stably express the differentially expressed or 
pathway gene protein can be engineered. Rather than using 
15 expression vectors which contain viral origins of 
replication, host cells can be transformed with DNA 
controlled by appropriate expression control elements (e^, 
promoter, enhancer, sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. 
20 Following the introduction of the foreign DNA, engineered 
cells can be allowed to grow for 1-2 days in an enriched 
media, and then are switched to a selective media. The 
selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably 
25 integrate the plasmid into their chromosomes and grow to 

form foci which in turn can be cloned and expanded into cell 
lines. This method can advantageously be used to engineer 
cell lines which express the identified gene protein. Such 
engineered cell lines can be particularly useful in 
30 screening and evaluation of compounds that affect the 
endogenous activity of the differentially expressed or 
pathway gene protein. 
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A number of selection systems can be used, 
including, but not limited to, the herpes simplex virus 
thymidine kinase (Wigler, et a^. , 1977, Cell 11:223), 
hypoxanthine-guanine phosphor ibosyltransf erase (Szybalska & 
Szybalski, 1962, Proc. Natl. Acad. Sci. USA 41:2026), and 
adenine phosphoribosyltransf erase (Lowy, et al,, , 1980, Cell 
22:817) genes in tk", hgprt" or aprf cells, respectively. 
Also, antimetabolite resistance can be used as the basis of 
selection for dhfr, which confers resistance to methotrexate 
(Wigler, et al. . 1980, Natl. Acad. Sci. USA 27:3567; O'Hare, 
et al. . 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, 
which confers resistance to mycophenolic acid (Mulligan & 
Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neo, which 
confers resistance to the aminoglycoside G-418 (Colberre- 
Garapin, et al. . 1981, J. Mol. Biol. and hygro, 

which confers resistance to hygromycin (Santerre, et al, , 
1984, Gene lfl:147) genes. 

An alternative fusion protein system allows for the 
ready purification of non-denatured fusion proteins 
expressed in human cell lines (Janknecht, et al. , 1991, 
Proc. Natl. Acad. Sci. USA 88:8972-8976). In this system, 
the gene of interest is subcloned into a vaccinia 
recombination plasmid such that the gene's open reading 
frame is translationally fused to an amino- terminal tag 
consisting of six histidine residues. Extracts from cells 
infected with recombinant vaccinia virus are loaded onto 
ni2+ nitriloacetic acid-agarose columns and histidine-tagged 
proteins are selectively eluted with imidazole-containing 
buffers. 

When used as a component in assay systems such as 
that described herein, the differentially expressed or 
pathway gene protein can be labeled, either directly or 
indirectly, to facilitate detection of a complex formed 
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between the differentially express d or pathway gene protein 
and a test substance. Any of a variety of suitable labeling 
systems can be used including but not limited to 
radioisotopes such as "Sj. enzyme labelling systems that 
generate a detectable colorimetric signal or light when 
exposed to substrate; and fluorescent labels. 

Where recombinant DNA technology is used to produce 
the differentially expressed or pathway gene protein for 
such assay systems, it can be advantageous to engineer 
fusion proteins that can facilitate labeling, solubility, 
immobilization and/or detection. 

indirect labeling involves the use of a third 
protein, such as a labeled antibody, which specifically 
binds to either a differentially expressed or pathway gene 
product, such antibodies include but are not limited to 
polyclonal, monoclonal, chimeric, single chain. Fab 
fragments and fragments produced by a Fab expression 
library. 

5.6. AMTIBODIES SPECIFIC FOR 
DIFFERENTIALLY EXPRESSED 
nP PATHWAY PRODOCT8 

Described herein are methods for the production of 
antibodies capable of specifically recognizing one or more 
differentially expressed or pathway gene epitopes. Such 
antibodies can include, but are not limited to polyclonal 
antibodies, monoclonal antibodies (mAbs) , humanized or 
chimeric antibodies, single chain antibodies. Fab fragments, 
F(ab')2 fragments, fragments produced by a FAb expression 
library, anti-idiotypic (anti-Id) antibodies, and epitope- 
binding fragments of any of the above. Such antibodies can 
be used, for example, in the detection of a fingerprint, 
target, or pathway gene in a biological sample, or. 
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alternatively, as a method for the inhibition of abnormal 
target gene activity. Thus, such antibodies can be utilized 
as tumor progression treatment methods, and/or can be used 
as part of diagnostic techniques whereby patients can be 
tested for abnormal levels of fingerprint, target, or 
pathway gene proteins, or for the presence of abnormal forms 
of the such proteins. 

For the production of antibodies to a differentially 
expressed or pathway gene, various host animals can be 
immunized by injection with a differentially expressed or 
pathway gene protein, or a portion thereof. Such host 
animals can include but are not limited to rabbits, mice, 
and rats, to name but a few. Various adjuvants can be used 
to increase the immunological response, depending on the 
host species, including but not limited to Freund's 
(complete and incomplete) , mineral gels such as aluminum 
hydroxide, surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, dinitrophenol , and potentially 
useful human adjuvants such as BCG (bacille Calmette-Guerin) 
and CoryneJbacteriuJD parvum. 

Polyclonal antibodies are heterogeneous populations 
of antibody molecules derived from the sera of animals 
immunized with an antigen, such as target gene product 
( e.g. . protein encoded by 030) , or an antigenic functional 
derivative thereof. For the production of polyclonal 
antibodies, host animals such as those described above, can 
be immunized by injection with differentially expressed or 
pathway gene product (e^, 030) supplemented with adjuvants 
as also described above. 

Monoclonal antibodies, which are homogeneous 
populations of antibodies to a particular antigen, can be 
obtained by any technique which provides for the production 
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of antibody molecules by continuous cell lines in culture. 
Thes include, but are not limited to the hybridoma 
technique of Kohler and Milstein, (1975, Nature 495-497; 
and U.S. Patent No. 4,376,110), the human B-cell hybridoma 
technique (Kosbor et al. . 1983, Immunology Today 1:72; Cole 
at al. . 1983, Proc. Natl. Acad. Sci. USA 8fi: 2026-2030) , and 
the BV-hybridoma technique (Cole et al.. 1985, Monoclonal 
Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77- 
96) . Such antibodies can be of any immunoglobulin class 
including IgG, IgM, IgE, IgA, IgD and any subclass thereof. 
The hybridoma producing the mAb of this invention can be 
cultivated in vitro or in vivo. Production of high titers 
of mAbs in vivo makes this the presently preferred method of 
production. 

In addition, techniques developed for the production 
of "chimeric antibodies" (Morrison et ^X, , 1984, Proc. Natl. 
Acad. Sci., 81:6851-6855; Neuberger et_ai^, 1984, Nature, 
3ia:604-608; Takeda et al. . 1985, Nature, 311:452-454; U.S. 
Patent No. 4,816,567) by splicing the genes from a mouse 
antibody molecule of appropriate antigen specificity 
together with genes from a human antibody molecule of 
appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are 
derived from different animal species, such as those having 
a variable region derived from a murine mAb and a human 
immunoglobulin constant region. 

Alternatively, techniques described for the 
production of single chain antibodies (U.S. Patent 
4,946,778; Bird, 1988, Science 212:423-426; Huston et alt , 
1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and Ward efc 
al. . 1989, Nature 121:544-546) and for making humanized 
monoclonal antibodies (U.S. Patent No. 5,225,539, which is 
incorporated herein by reference in its entirety) can be 



utilized to produce anti-diff rentially expr ssed or anti- 
pathway g ne product antibodies. 

Antibody fragments which recognize specific epitopes 
can be generated by known techniques. For example, such 
fragments include but are not limited to: the F(ab')2 
fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F{ab')2 
fragments. Alternatively, Fab expression libraries can be 
constructed (Huse et^, 1989, Science, 246:1275-1281) to 
allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

5.7. nvr.T.. AMD AN TM^T.-BASED MODEL SYSTEMS 

Described herein are cell- and animal-based systems 
which represent reliable models for tumor progression 
disorders. These systems can be used in a variety of 
applications. For example, the cell- and animal-based model 
systems can be used to' identify differentially expressed 
genes via the paradigms described, above, in Section 
5 1.1.1. such systems can also be used to further 
characterize differentially expressed and pathway genes, as 
described, above, in Section 5.3. Such further 
characterization can, for example, indicate that a 
differentially expressed gene is a target gene, for example. 
Additionally, such assays can be utilized as part of 
screening strategies designed to identify compounds which 
are capable of preventing and/or ameliorating symptoms of 
tumor progression disorders, including those associated with 
metastatic diseases, as described, below. Thus, the animal- 
and cell-based models can be used to identify drugs, 
pharmaceuticals, therapies and interventions which can be 
effective in treating tumor progression disorders, such as, 
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for example, metastatic diseases. In addition, as described 
in detail, below, in Section 5.10.1. such animal models can 
be used to determine the LD50 and the ED50 in animal 
subjects, and such data can be used to determin the in viys 
efficacy of potential anti-tumor progression disorder 
treatments . 

5.7.1. llMTMAL-BASB P flYflTEMS 

Animal-based model systems of tumor progression 
disorders can be both non-recombinant animals as well as 
recombinant ly engineered transgenic animals. 

Non-recombinant animal models for tumor progression 
can include, for example, murine models of melanoma, 
prostate cancer and colon cancer. Such models may be 
generated, for example, by introducing tumor cells into 
syngeneic mice using techniques such as subcutaneous 
injection, tail vein injection, spleen implantation, 
intraperitoneal implantation, implantation under the renal 
capsule or orthotopic implantation (e^. , colon cancer cells 
implanted in colonic tissue or prostatic cancer cells 
implanted in prostate gland) . After an appropriate period 
of time, the tumors which result from these injections can 

be counted and analyzed. 

Among the cells which may be used for the production 
of such animal models of tumor progression are cells derived 
from the cell lines listed, above, in Section 5.1.1.1. For 
example. B16 melanoma cells (Fidler. I.J.. 1973. Nature New 
Biol. M2:l48-149). including cell variants exhibiting high 
(e.g.. B16 FIO cells) and low (e^, B16 Fl cells) 
metastatic potential may be utilized. Post-injection, 
pulmonary tumors generally develop in the mouse models. 
Thus, these animal serve as models of not only melanoma 
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tuiuor progr ssion but alBO as models o£ pulmonary 

" '""Tor the generation of animal models of colorectal 
cancers, colon cancer cells sucH as, for example, KM12C low 
TtastaUc potential, and KHUU ,.i,niy --- - - - 

(MoriKawa, K. "S8, Cancer Research ^^=1943 

L be implanted into nude mice spleens. In ^^ese cases 
the animals generally develop hepat.c tumors Thus 
animals serve as models of not only - ^^"^ 
progression but also as models of hepatic 

For the generation of animal models of prostate 
cancer tumor progression, cells derived from, for example 
the high metastatic potential prostatic ce 1 Une P -3 M 
the „o„-metastatlc cell Une DO 145 (Karmal , R... 
1,87, Anticancer Res. 7:1173-1180; Koziowskx, f^' 
1984 cancer Research 44=3522-3529, may be implanted into 
the ;rostates of animals and the resulting tumors may be 
analysed and compared to, for example, normal "«ue In 
Tch a manner, genes which are differentially expressed in 
neoplastic versus normal cells as well as versus metastatic 

cells may be identified. ^ , „ mo cene 

The role of identified gene products (4^, 030 gene 
products) can be determined by transfecting cDNAs encoding 
Zl gene products into the appropriate cell line and 
:a!ing L effect on the cells- ability " ---^ ^ 
progression in animal models such as these. The rol 
identified gene products may be further analyzed by for 
exampi; culturing cells derived from the tumors which 
develop in the animal models, introducing these cul ured 
, cell !nto animals, and subsequently measuring the level of 
Tdlitified gene product present In the resulting tumor 
clus in Lis manner, cell line variants are developed 
h'can be useful in analyzing the role of quantitative 
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and/or qualitative differences in the expression of th 
identified genes on the cells' ability to induce tu»or 
proores-ion. For example, as demonstrated, below, in the 
Example presented in Section 6, 030 gene expression is 
inversely related to the metastatic potential of the tumor 
cell line used to generate such a tumor progression animal 

Additionally, recombinant animal models exhibiting 
tumor progression characteristics and/or symptoms of tumor 
progression disorders, including metastatic diseases, can be 
utilized, for example, such well-known animal models as the 
transgenic mouse model tor human melanoma and transgenic 
mice Which carry specific mutations which result in multiple 
intestinal tumors (Mintz, M. and Silvers W.K., 1993, Proc. 
Natl. Acad. sci. OSA 2il:8817-8821; and Fodda, R., et al., 
1994, proc. Hatl. Acad. Sci. USA 21:8969-8973) . Further, 
recombinant animal models for tumor progression can be 
engineered by utilizing, for example, target gene se<r.ences 
such as those described, above, in Section 5.4, in 
conjunction with techniq.es for producing transgenic animals 
that are well Known to those of skill in the art. For 
example, target gene sequences can be '"J 
overexpressed in, the genome of the animal of interest, or, 
if endogenous target gene sequences are present, they can 
either L overexpressed or, alternatively, can be disrupted 
in order to underexpress or inactivate target gene 

''^""iT'order to overexpress a target gene sequence, the 
coding portion of the target gene sequence can be ligated to 
a regulatory sequence which is capable of driving gene _ 
expression in the animal and cell type of f * 

regulatory regions will be well Known to those of sKUl in 
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«. art. and can be utUl.ed in the absence of undue 
e.peri»entat^. ,„,erexpres3lon of an endogenous 

in order sequence can be introduced into 

target gene sequence ^^^ ^^^^ the 
the gence of the f f inactivated, 

endogenous ""^^J^/J^^^^^^^^^^ „.p.ising at least part 
preferably, an engineered qu introduced, 

°' nirge::: Tcrirtre oogenous target sequence 

rdrs^tr/r---- r^rtt^^^^^^^^^^^^^^^^^ '^"^ 

sequence into the animal's genome. Gene targ 

"=="'":;mr:f an;ter:Xiu-n^ - - — 

. « rabbits, guinea pigs, piqs, micro-pigs, 

to, mice, rats, rabbits gu monkeys, and 

™?rei:rto e;e^ animal models of tumor 
rumrprogrLion disorders, such as, for 

Any tecnniqu . animals to produce 

introduce a -'fjf^:: r animals, such techniques 
"rtt : e n H: tld to pronucuar microinjection 
include, but are no^ „ „ loaq n S. Pat. No. 
,Hoppe, P.O. and ^^'^^'^ 'J^'^-J'^:^^ "^l.er into germ 
,,873,191); retrovirus -^""^^^ ,,ed. Sci. , 

lines (Van der Putten „ ^^,^„„i, 3tem cells 

US. rn::;,..!,; electroporatlon of 

1^^^. -U. Mol. .= -3-lS.. an. sp.rm^-_ 
^ «^nA transfer (Lavitrano et_aU, l^ay, — 
mediated ^^^"f ...j, techniques, see Gordon, 

, 723); etc. For a review 

1989, Transgenic Animals, Intl. ^-J'^ entirety. 
Which is incorporated by reference herein 
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Th present invention provides for transgenic 
animals that carry the transgene in all their cells, as well 
as animals which carry the transgene in some, but not all 
their cells, i^, mosaic animals. The transgene can be 
integrated, either as a single transgene or in concatamers, 
e.g. . head-to-head tandems or head-to-tail tandems. The 
tranlgene can also be selectively introduced into and 
activated in a particular cell type by following, for 
example, the teaching of Lasko et_aU (Lasko, M. , et^aU, 
1992, Proc. Natl. Acad. Sci. USA 89:6232-6236) . The 
regulatory sequences required for such a cell-type specific 
activation will depend upon the particular cell type of 
interest, and will be apparent to those of skill in the art. 

When it is desired that the target gene transgene be 
integrated into the chromosomal site of the endogenous 
target gene, gene targeting is preferred. Briefly, when 
such a technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous target 
gene of interest are designed for the purpose of 
integrating, via homologous recombination with chromosomal 
sequences, into and disrupting the function of, the 
nucleotide sequence of the endogenous target gene. The 
transgene can also be selectively introduced into a 
particular cell type, thus inactivating the endogenous gene 
of interest in only that cell type, by following, for 
example, the teaching of Gu st_ai^ (Gu, H. , e£_ai^, 1994. 
science ifiS: 103-106) . The regulatory sequences required for 
such a cell-type specific inactivation will depend upon the 
particular cell type of interest, and will be apparent to 

those of skill in the art. 

once transgenic animals have been generated, the 
expression of the recombinant target gene and protein can be 
assayed utilizing standard techniques. Initial screening can 
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o« the transgene in V/lch include, but are 

can also be assessed """^^ of tissue samples 

„ot XiBited to, northern ^'^^ ^^^i^i^'-^on analysis. 

°"'r rredTcrTa tft-U ,ene-e^ressi„, 

ana RT-c-Pl'^^'^^; J^^^^^ i^unooytoche-ically usxng 
tissue, can also be eva i„terest. 
antibodies specif le for tne m ^ that express 

The target gene transgenic animals ^ J 

=«i or target gene transgene peptide (oete 
target ";73t„;,„tibodies directed against 

i^unocytochsmically, usxng ..^ectable levels 

target gene ^''^^^^^'ZZte. to identify those animals 
should then be further evai characteristics. 

Which display tumor ^'"^"/"fj^f/s^^^^toms. Such tumor 
including tumor progression disorder ^ymp 

p.c,ression disorder --^^^^^^^^^^^^^ ^"h tumor cells 

include, for example, those asso ^^h 
as found in human melanoma breast g ^^^.^^ 

, types of skin cancers. 

Mditionally, ^^^^^^^ ^^^^ pnenotypes 
transgenic animals can be -"^^V^d f 
Characteristic of tumor progression „^ 
phenotype. can include. -"f;;/,:!^ ,iven tumor 

0 expression characteristic °^ ^^'^^'^^ 

.egression --J-^Ts IsII^'a particular cell 
reX:r:inrern of expression and its comparison to 



known fingerprint expression profiles of the particular cell 
type in animals exhibiting tmnor progression. Such 
transgenic animals serve as suitable model systems for tumor 
progression disorders. 

Once target gene transgenic founder animals are 
produced ( i.e. , those animals which express target gene 
proteins in cells or tissues of interest, and which, 
preferably, exhibit tumor progression characteristics) , they 
can be bred, inbred, outbred, or crossbred to produce 
colonies of the particular animal. Examples of such 
breeding strategies include but are not limited to: 
outbreeding of founder animals with more than one 
integration site in order to establish separate lines; 
inbreeding of separate lines in order to produce compound 
target gene transgenics that express the target gene 
transgene of interest at higher levels because of the 
effects of additive expression of each target gene 
transgene; crossing of heterozygous transgenic animals to 
produce animals homozygous for a given integration site in 
order to both augment expression and eliminate the possible 
need for screening of animals by DNA analysis; crossing of 
separate homozygous lines to produce compound heterozygous 
or homozygous lines; breeding animals to different inbred 
genetic backgrounds so as to examine effects of modifying 
alleles on expression of the target gene transgene and the 
development of symptoms for tumor progression disorders. 
One such approach is to cross the target gene transgenic 
founder animals with a wild type strain to produce an Fl 
generation that exhibits symptoms for tumor progression 
disorders. The Fl generation can then be inbred in order to 
develop a homozygous line, if it is found that homozygous 
target gene transgenic animals are viable. 
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5.7.2. rBT.T.-BAflE P A88AY8 
cells that contain and express target gene sequences 
Which encode target gene protein, and, further, exhibit 
cellular phenotypes associated with tumor progression 
disorders, can be utilized to identify compounds that 
exhibit an ability to prevent and/or ameliorate tumor 
progression. Cellular phenotypes which can indicate an 
ability to ameliorate symptoms of tumor progression 
disorders can include, for example, tumor cells with low or 

hiah metastatic potential. 

Further, the fingerprint pattern of gene e^tpression 
of cells of interest can be analysed and compared to the 
normal fingerprint pattern. Those compounds which cause 
cells exhibiting cellular phenotypes of tumor progression 
disorders, including metastatic diseases, to produce a 
fingerprint pattern more closely resembling a normal 
fingerprint pattern for the cell of interest can be 
considered candidates for further testing regarding an 
ability to ameliorate the symptoms of such diseases. 

cells which will be utilized for such assays can, 
for example, include non-recombinant cell lines, -"^"^ 
b" not limited to. melanoma Fl and B16 F10 cell 

lines), human colon (fi^, KM12C and KM20L4 cell lines) 
prostiU DU 145 and PC-3-M cell lines) and breast 

cancer cell lines (s^, MCF-7 and MDA-MB-435 cell 1 nes) . 
in addition, purified primary or secondary tumor cells 
derived from either transgenic or non-transgenic tumor cells 

"^«er, cells which can be used for such assays can 
also include recombinant, transgenic cell lines. For 
example, the metastatic disease animal models of the 
"venti;n, discussed, above, in Section 5.2.4.1, 
to generate cell lines, containing on. or more cell types 

- 70 - 



involv d in metastatic diseases, that can be used as cell 
cultur mod Is for these disorders. While primary cultures 
derived from the metastasis in transg nic animals of the 
invention can be utilized, the generation of continuous cell 
lines is preferred. For examples of techniques which can be 
used to derive a continuous cell line from the transgenic 
animals, see Small et al> . 1985, Mol. Cell Biol. 5:642-648. 

Alternatively, cells of a cell type known to be 
involved in metastatic diseases can be transfected with 
sequences capable of increasing or decreasing the amount of 
target gene expression within the cell. For example, target 
gene sequences can be introduced into, and over expressed 
in, the genome of the cell of interest, or, if endogenous 
target gene sequences are present, they can either be 
overexpressed or, alternatively, be disrupted in order to 
underexpress or inactivate target gene expression. 

In order to overexpress a target gene sequence, the 
coding portion of the target gene sequence can be ligated to 
a regulatory sequence which is capable of driving gene 
expression in the cell type of interest. Such regulatory 
regions will be well known to those of skill in the art, and 
can be utilized in the absence of undue experimentation. 

For under expression of an endogenous target gene 
sequence, such a sequence can be isolated and engineered 
such that when reintroduced into the genome of the cell type 
of interest, the endogenous target gene alleles will be 
inactivated. Preferably, the engineered target gene 
sequence is introduced via gene targeting such that the 
endogenous target sequence is disrupted upon integration of 
the engineered target gene sequence into the cell's genome- 
Gene targeting is discussed, above, in Section 5.7.1. 

Transfection of target gene sequence nucleic acid 
can be accomplished by utilizing standard techniques. See, 
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"r"°^d for trpresence of reco^lnant target gene 
^A, and for the p „^g„in a decrea« in 

protein P"^-"""" ".^"frired, standard techniques can 
target gene expression is desxr , ,„d„,enous 

.,<ied to demonstrate whether a decree 
Target gene expression and/or in target gene product 
production is achieved. 

^oiiowin, assays are designed to identify 
compounds that .ind to ^X':^::^:^^ product, 
cellular ^"^^"'^ ^^,1, the interaction of the 

and to compounds 'hat intert p„teins. 
target gene product with other cellala P 

such compounds can ^'^^^ „n 

other cellular P""/- ^/^t ^^^^^i^^^^^ L, for 
Include, but are not limited to, p P 

, example, soluble peptides, -=^f -'^j^^^J^l, potions of 
tailed fusion peptides, comprising extraceU P 

transmembrane receptors, ana in«»u« 
target gene product ^ransme ^^^^ ^^^^ ^^^^^ 

random peptide libraries (see, s^, ^ 
Mature 354:82-84; Houghton, R. et_aU, i^yi, 
, "t'of cand/or --^---re: To 

antibodies (including -^^.f^;^,"^ : '^l^Lic or single 

,0 monoclonal, '>'»-"^*' th'^and FAb expression Ubary 

chain antibodies, and FAb, F(ab an 
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nt., ana epUope-Mn.in, ..a^en.s tn« o« . a„a s^U 

organic or ^^"''^ ^l^^^^^i, „says sue. as those 

co^ounds .d n"''-;^ ;; ^^^p^,, i„ elaborating 

aesoribea herein can be p,^,ot, and for 

the biological function of the "'' J j„ instances, 

r::^1 "grslon state or disorder 

rsutr^o: :Towe: o.erall -1 ^ rrge'tTene product 
egression, tar^ pr^sion state 

activity in a "l^J;"^"^ „ith the target gene 

or disorder, compounds that in ^ ^he 

product can include ones which compound. 
Ltivity Of the bound ^^^^^^^^^^ xevel of 
«ould bring about a" ^ ve^^^^^^^_^^ ^^^^^^ 

target gene activity, tn ^^^^ instances whereby 

tuMor P'"'"::^"" "'/".tret gene ca^se aberrant target gene 
mutations within the target g effect that 

proteins to be made -^J^^^rn'tLt bind target gene 
.ead. to tumo^ ^^-^^ Li,i. the activity of the 

protein can be identlli ^^^^^^^ 
bound target gen. protein. J example, 
effectiveness section 

rdrssirrior in section 3.3... 

^ ^ systems can ^^^^^^ ^T^''^' ^ 
compound, capable of "^"''i"' ^/Z^';! :3eful, for example. 

,0 in modulating J„„„„t target gene proteins, can 

Tusffririubinrng^he biological function of the 
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rrrrr rti^vL .... s.. 

^"""''tnrinc.p.e ..e u- ^ ^^^^^^^ 

preparing a reaction ^ j„ » time srf£i=ient 

L test co»pcuna under ^""^^^tt and bind, tnus 

to allov the two ".pcnents to ^^^^^ ^^^^^^^ 

forming a complex ""^^ ''^„„, be conducted in a 
the reaction mixture. These a y ^^^^^ ^^^^ 

variety of ways. For °" product or the 

.3say wouxd invoive " tnd'detectin, target gene 

test substance onto a solid P ^^^^^ ^^.^ 

product/test compound complexes ^^^^^^^ 3^,^ , 

,t the end of the paction. In o ^ ^^^^^ 

„,thod. - nr^^^ :Mch is not anchored, can 

surface, and the = * indirectly. 

be labeled, either '"".^""^ ^,^,3 „n conveniently be 

X„ ^""""V rlase T^e anchored component can be 
utilized as the solid phase. ^^^^^^^^ .^^c^ents. Hon- 

immobilized by """'"^f ^"\^„„pUshed by simply coating 
covalent '--^f^''l;^\'i:^:Z of the protein and drying, 
the solid surface «^ » preferably a 

5 Alternatively, an immobilized ^ 

ir;srt c^r the Uein to the .olid 
rale'lhrL^rs can be prepared in advance and 

Stored. . - vbe assay, the nonimmobilized 

in order to conduct the assay, . . the 

'° V is added to the coated surface containing 

component is addea t: reaction is complete, 

anchored co^apon-t- Af ^^^..^^^ ,„,er 

unreacted components are r 
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conditions such that any complexes formed will remain 
immobilized on the solid surface. The detection <>' 
complexes anchored on the solid surface can be «"mpUsh«. 
l„ I number of ways, where the previously immobilized 
Iponent is pre-labeled, the detection of ^^^^^^^ f 
on the surface indicates that complexes were formed. Where 
the previously nonlmmobillzed component is not pre-labeled, 
Z indirect label can be used to detect complexes anchored 

1 surface, ^. usin, a labeled antibody spec^ - Jor 
the immobilized component (the antibody, in turn can be 
Erectly labeled or indirectly labeled with a labeled antl- 

ig antibody). i„ , 

Alternatively, a reaction can be conducted in a 
IK^id phase, the reaction products separated from unreacted 
components, and complexes detected, 9^, using an^ 
immunized antibody specific for target gene or the test 
compound to anchor any complexes formed in solution and a 
labeled antibody specific for the other component of the 
possible complex to detect anchored complexes. 

. . « >ao>VB TOK CBLLOLMt PKOIBIIIS THAT 



25 



Any method suitable for detecting protein-protein 
interactions can be employed for identifying novel target 
^roduct-cellular or extracellular protein interactions 
Lse methods are outlined in section 5.1.3., for the 

Identification of pathway genes, and can utilized herein 
with respect to the identification "^.P-*"- 
interact with identified target proteins. In such a case, 
the target gene serves as the known "bait- gene. 
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Tn. t«9.t ,ene products o^^^^^^^ ,,tra=.llular 
interact with one or »or ^cromoleculee 
^;.„xecu.ee. sucH -X^/ :^-„„,,eic acid »oXecules an. 
include, .ut are f/,, such as those 

.hos. products "-"^ 7 the purposes of 

described, above, in extracellular 

discussion. - .s "bindin, partner. 

^croBolecules are referred * ^ useful xn 

compounds that disrupt such in 

r^latin, the ^^^^^^''/^ ^protts. Such compounds can 

Jlude, but are not ..scribed in Section 

antibodies, peptides, 

5.3.1. above. „„y systems used 

™' mt intrr'ere with the interaction 

identify compounds that l cellular or 

, ^tween the tar,et ,ene product „, 
extracellular -^^'"'^ ^^^^ t Jtar,et ,ene product, and 
, reaction mixture c"""^""' ..g „d for a time 
thrbindin, partner ^ f^^. .c interact and bind, 

sufficient to alio- the two P ^ ^'^'Z the 

formin, a complex- m or ^^^^^ , 
inhibitory activity, .^.p^und. The 

pre.«.c. and ^included in the reaction mixture, 

compound can b. initiaUY addition of 

„r can b. added at a J^^t^cellular binding 

complexes between the targ 
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„ e>.«c x.u.« .in..n, z . 

formation of a eompUx in « impound, Indicate, 

tn -ction -7;;;/:!^ ... interaction of tne 

that the compound i""" interactive binding partner, 
target ,ene product -^^^ ;7„"„,,,i„ „action mixture. 

containing th. test co P formation within reaction 

can also he compared to -P^- „„,„t target gene 

fixtures containing the test J ^,,3. ca.es 

product. Thi. comparison ^ a. that disrupt 

Lrein it is " 1^ target gene products, 

interactions of mutant but "^^J' ^terfere with the 

«,e assay for compound. «at 1 ^ ^^^^^^ ^^^^^^^^ 

interaction of the ^J-^^^^^j;^ „ ,o„ogeneou. format, 
can be -"^-"^/";;;:r;e\nchoring either the target 
Heterogeneous ""1" „ onto a solid phase and 

gene product or the >=^"^1"' /""y,, „ud phase at the end 
detecting complexes a-*-""* f^^^^^,, the entire reaction 
cf the reaction. In homog^""/ * i/. ther approach, the 
0 is carried out in a ^^'^^^^^'^^ ,„ied to obtain 

crder of addition of "^^'^^^^^^^^s being tested. For 
different information ^'""f J*^ ^^th the interaction 

example, test compounds that in ^^^^^^ 
between the target gen. ^^^^ conducting the 

^. hy competition, ^^^'fe /t .uJtance; i^. 
taction in the presence of th ^„ 

adding th. "•-;'*;;^"::;:.^:t gen. product and 
or .imultaneouely **,t,i„i„ binding partner, 

interactive cellular or ^^"^'^ preformed 
Alternatively, test c^P/^f^^^^l, binding constants that 
complexes, ^ compounds with high ^ 

displace one Of ---- ^ „ ^to the reaction mixture 
tested by adding the test co p 
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A The various formats are 
.„er ccBplexe. have ^e„ fo^.. The 

described brietly below targ t 

in a heteros neous assay J extracellular 
prcuc. or tne -^^-rrsoU. surface, while the 
.mam, partner, is a-"";^ directly or 

„„n.an=hored -P-^^^^ '^"^f^^Utre plates are conveniently 
indirectly, m °^Ies can be iB^obilUed by non- 

utilUed. The anchored species ^ ^^^^„^ attachment 

Tovalent or covalent .oUd surface with 

be accomplished s Imply ^^^J partner and 

a solution of the target P ^^^^.t,,, ,or 

dryln,. alternatively, an immobU ^^^^^^ ^^^,3 

J species to be anchore* an^ ^ in 

to the solid surface. The 
i advance and stored. partner of tM 

in order to conduct th ^ith or 

l^oblUzed specie. Is e^osed to ^^^^ 
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in otv*^- — ^ *.jje coatea 

l^oblUzed specie. Is e^osed to 

without the test -P--^:::: by washln,. and any 

unreacted remain lm»oblll«-i on the solid 

complexes formed will "^^"J „ anchored on the solid 
surface. The "/^.Ta number of ways, vmere the 

surface can be «~"^^^^"*;;,.ia.eled. the detection of 
non-lmmoblU«a indicates that complies 

label ImmobllUed "^^^^^l^^iiLed species Is not pre 
«ere formed, where the no" 1 ^^^^^^ complexes 

labeled, an Indirect label «n b ^ ^^^^^ ^ 
anchored on the »"*--^^„;.i„oblll«d species (the 
specific tor the l"^'""" "^^^.i, labeled or indirectly 

labeled with a labeie components, test co p 

°';irx "o-": or which disrupt preformed 
which inhibit complex to 
complexes can be detected. 

- 78 - 
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Uternativ^ly. the ^^'^^'^Z^ot'tT^^- 

coBpound, the ^. u=ln, an 

components, and "^^^'"^ one o£ the binding 

i^obilized antibody -P-^"° i„ solution, and a 

ooBponents to — ^^^^/^^^/tL other partner to detect 
labeled antibody ^ ,i„, upon the order of 

anchored complexes. Again P ^^^^ compounds 

''TZZ or w^lch Tlsrupt preformed complexes 
which Inhibit complex or 

can be iO-^^^^^J^ embodiment of the invention, a 

in an alternate e ^^^^ approach, a 

nomogeneous assay can be u ^^^^^^ 
preformed complex of ^"'^ J ^^„i„ binding partner 
interactive cellular or -^^'^^ ,„,et gene products 

product is prepared in "^^^,e,, the signal 

or their binding partners are ^ j^^,lc„ 

generated by the label is ^--^^f ,„,enstein which 

.a U.S. patent No. 4,109," i addition of a 

(see, s^a... for immunoassays). The ao 

0 utili.es this •PP"''^'^^'" , and displaces one of the 

test substance that ""P'"^ result in the 

the preformed complex wi , test 
species from the p taokground. m this way, 

generation of a signal above cduct-celWlar or 

Lbstances which ^^^"^^^"nX/acUon can be identified. 
,5 extracellular binding P«»"« target gene product 

in a particular e"*"*";"'' recombinant 
can b. prepared for ^^"^^'^^^fZ ^^. Por example, 
technigue. described in "ction ^"^^ . ^ ,i,,athione- 

target gene coding «ctor such as pOEX- 



using methods routinely 
« r,ls . .on '7^:::^,i;;;iv.. in section 5.2.4. 

practiced in the art -^J-"; „aloactive isotope 

^is antibody can "« 1;'^^; p„ctic d in the 

i25i, for example r by nez.no cST-Target gene 

in a heterogeneous ^^;„,hione-a,arose beads, 

fusion product can be binding partner 

.ne interactive "J^^rp^esence or absence of the 

product can then be added ^" ^ ,„«raction and 
,est compound in a Banner that 

binding to occur. At the ^^^^^^ 
unbound material can be ;7;^,, and allowed 

monoclonal antibody can be added ^„„„„ti„„ 
to bind to the coBple.ed "^^^ interactive cellular 

between the target gene P^^-^/fJ^ ,,,e=ted by 
, or extracellular binding P^ ^f ,,,, 

measuring the amount -\'f'^^'Zros. beads. A successful 
associated with the ^l""f^^°;f ^ test compound will 
inhibition Of the Tadioactivity. 
result in a decrease - -"^^^ £,.ion product 

Alternatively the 

and the interactive cellular ^^^^^ 
partner product "^^t^, J.a,arose beads. The test 

absence of the '"^^V^hlr luring or after the binding 
compound can be added J^/"^^,, .^^^e can then be 

partners are ^""-^.^'/^re" beads and unbound Material 
added to the ^^-^^"^^'"^^^^^^^ Inbibition of the 
1, washed away. Again the eKt ^^^^^^ 

binding partner i"""'""" associated 
labeled antibody and measuring the 

30 with the beads. the invention, these same 

in another embodiment oi fragments that 

technigues can be '^^^^'^^^^s T^ tarfet gene product 
correspond to the binding domains of 



plac ct on. or both of the tul ^ 

„^er of '^^fi:;^:Z protein's Mndln, site. These 
" identify and -"^^^/^^ to. mutagenesis of one 
methods include, but products and screening for 

the ,e„es enccd.n, ;'J^,^,^^,,,,.^,on assay. 

disruption of binding in a co 

compensating mutations in 9 

species in the ""^J;" ""^^^p,,,,,, products «iU reveal 
of the genes encoding the r P ^^^^^^ 

the mutations that " ^ternatively, one product 

involved in l"""="^\td s :'ace using methods described 

can be anchored to a solid surface ^^^^^^^ 

in this section above, and alio ^^^^^^ 

,o its labeled ""-^"'/"t/;;^ fn. After -ashing, a 
a proteolytic enzyme, such t yp ^^^^^^ 

sh'ort. labeled peptide ^^^^^^Z;,:,, ii.n can be 

remain -''I'^^J^;^:^ ali^o acid seguencing. Also, 
a isolated and identified y extracellular 

once the gene coding for tn segments can 

binding partner product » "f^^^nts of the product, 
- engineered to egress P-* - purified 
Which can then be tested tor 
25 or synthesized. 

^ Of the -rr'i " rrr 

limited to, compounds such as those^^^^ ^^^^^^ 
30 foregoing assay ^^^^^jj''^^^,,, of tumor progression and 

^"^^r^rsir isorersHncluding metastatic di 

tumor progression 



and/or ai. 11="" t: 

«ni=h f systems can be .xpo". to a P 

For example/ sucn v,4Htv to ameliorate ^v^u 

progression symptoms, at a ,„,aoration in the 

a tl»e sufficient to ell"' s» examined to 

"exposed ceus. Mter ej--; ^,^„3sion state or 
determine whether one or ^„ teen altered to 

Lor P--rno"r:i-tVPe, non-n.^^^^^^^^ 
resemble a more norm 

disease phenotype. progression l"^"!""' 

Taxing, as an „ ^he highly metastatic 

. <. cell-based systems sucn a o-on exposure 

metastasis, cell utilised. DP" 

Bi6 no melanoma cell ime j„ their 

„ such cell systems, ^^Ic ^tential of such cells- 

ahillty to reduce the -'^^^^^^^^J^.^^ion within these cells 
Purther, the level of 030 gen ^ , 
- be --ea - 1 ,„.,cate an amelioration 

level of 030 gene expre . gtate. 

of the metastatic ^"^^^^^eVems, such as those 

" I„ addition, animal-based J ^ ^^^^^.j, 

- described, above, in ^j^ptoms of tumor 

compounds capable of -"^^"f^^f ,e used as test 
progression. Such ^-"^ "^^Uon of drugs, pharmaceuticals, 

^ T'e^rtra ;uspected to e.lblt an 

xnodels can be expos 
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Buftlol nt concentration response 
such an a.elloration xn ^-J^'^ be monitorec. by assessing 
,t the aniBals to the f ,„o, progression, 
eh. reversal of <'^-^7^;„":: ttat»ents which reverse 
With regard to ^"""f;;;;„::'progression. such as. tor 
any aspect of sy.pto«s of ^^^'J j hould be 

„ discussed in Section ^e utilised to 

^Ti\rora"cr;rdC.eiiorate sy.pto»s of 
assess the ability of a c P disorders. For 

tusor progression and ^ZZ or a fingerprint 
example, fingerprint '"/^^'^'^/^rassess.ent. Fingerprint 
:re;rsr ri:U:rnt patterns are described, 

below, in section 5.11. characterized for known 

fingerprint l^l^-^i:^^.:..^^^^^ , neoplastic or 
states or known P animal-based 

metastatic states, within the ce and/^ ^^^^^^ 

.odel systems, ^-''-^f^^^^^^^^ain the effect a test 
patterns can be compared to asce^^_^^ 

i5 compound has to modify ' ^^^^^ that of a more 
cause the pattern to more closely 

desirable fingerprint J^";" . , oompound can cause 

Por example, disease model system 

the fingerprint pattern of -^^^ '^^ ..stem. 

30 to more closely resemble a , cause the 

Administration of a compound can ai ^^^^^ 

eingerprint Pattern of a ;^„^„,,,,io di tates. 

..v-orfT-essLon states, sucn 
tiimor progression 
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Mcnltorlng the °' ^/^slc dru, screening, 

p,cre«ion can be »PP^^;^;°\°;/„,, ,Unlcal triaU, the 
U auo in cUnlcal "^^^^ ,,,, been discovered .n 

expression of a panel of ^-/^ ^„ ^..^ion 5.1.1.1 can 

one o. the P-^-^ re P".— °' ' 

be used as a "read out 

paradi^ describin, the B^-^ ^o, that are 

Identification of ""^"f^^ror clus. .or exa.pXe, - a 
down-regulated in -"=;f;;^^„ isolated fro- the pri-ar, 
clinical trial. tuMor ^„,„ed and analysed by 

,„ors removed by -^^^^^//^tbed in Section 6.1. The 
aif f erential display as ,enes can be 

levels of expression of the f ing P .^^^.^ea 
^antified by Horthern measuring the amount of 

fn section ..1. - J,ods described in Section 

, protein produced, by one °^ «^ .^les can serve as 

I,,,, xn this way. ^^J^^;Z the metastatic potential 
putative bio.arKe« indi at ^^^^^^^^^^ 

of the tumor cell. suitable 
eU-- - "-""/jrdr, ct: be developed based on 
,3 che.other,peutic antican- dr^.^ ^^^^^ ^ ^'ttve 
the metastatic P<'"""f^^^^^.^ disease, patients can have 
cases of inoperable -atastatic d expression so 

riopsie. removed - -^rrmrasured by monitorin, the 

^"'^ TofTs Ir X«ssion of romy030. 
30 degree of resTio 
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iESMHB!tj*^f^;i:;;;n;ions Which 

cscrlbea herein p,o,r sslon and 

usea .»eU°»" Mrvla, £i"t, target 

rJaerrtnvoXvin, tu.or progress on v. , ^^^^^ 

.oauution. : ■ ;„,et ,ene .ovulation can 

Involved m tumor P^'^f^^ „,,„e, depending on the 
be of a positive or negative n ^^^i^tory event 

rrriti: rc; 

fields a net resul ^ 
meliorated. , ,5 used herein, ret 

..negative modulation, ^^get gene 

■ 1 the level and/or ^"""^"^^^^..^^ of the target 
eduction in the ^^^^^ ^^^^^^ ^^^,vity 

product ^^3,„ee of the modulatory tr 

' -Clt^: - rrg'et'tne product 

---:rereraX--r---^"^"^^ 

-rtren:: of r::-lon can he .ought 

.ovulatory ^-f- t^ger^en. product activity levels 
expression leve ^..-slon 
alscussed m section 5.9.1 ^^^^ ^^^^ p,ogre«^ 

Alternatively, it P° . the absence or 

« about, at least in part Y ^^a^ctlon in 

30 can be f ,ene expression or a r 

.eduction Of the i .s uch^^ 

the level of 9 expression and/ 

increase in the 1 



,^ brlnq about th. amelioration of 
o£ such gen products would br.n, 

tumor progression ^^^^"'^.^^^^ i„ the Example 

Por example, as demonstra ^^^^^^^ ^^^^^ 

presented in Section 6, bel . ^^^^ ^ ^^^^^^ metastatic 
5 030 gene expression corre ^^^^^^ modulatory 

tumor progression state ^^„ssion in cells 

technique which increased 030 ge .^.^i^, 

within a highly ^7;^ Symptoms of such a state, 

therefore, act ^^^^^^^^^^^^^^ .^,bit general 
,0 Further, because the 030 g P ^^^^^^ ^ 

ri^irmrilirrhnlgue cou. ameliorate symptoms of 

„.„y tumor P'.°^"^::,;:;"te;,„i,.es for increasing the 
positive „ ,„get gene product 

15 target gene 'J^'ed in Section 5.9.2, below. 

z:zz rrrr.;i ...... - 

20 5.9.3. „,„aression events which may be 

Mong the tumor progressi^ ^^^^ 

treated are those " ^^,„ melanomas, breast, 

tumors may --^^ ^.eal. stomach, colon, bowel, 

,astrointestinal. prostate, bladder, 

„ colorectal and "^^^^f/^f/, 'oervical, brain, lung, 

testicular, ovarian, uterin , thyroid, 

bronchial, larynx, jj^^, types of skin 

bone, leuxemias, lymphomas and 



cancers . 
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5.9.1. MBGATIVB MQDTTIATORY TBCHNIQUE8 
AS discussed, above, successful treatment of tumor 
progression symptoms and of disorders involving tumor 
progression can be brought about by techniques which serve 
to inhibit the expression or activity of target gene 
products . 

For example, compounds such as those identified 
through assays described, above, in Section 5.8, which 
exhibit negative modulatory activity, can be used in 
accordance with the invention to prevent and/or ameliorate 
symptoms of tumor progression, including tumor progression 
involving metastatic disorders. As discussed in Section 
5.8., above, such molecules can include, but are not limited 
to peptides, phosphopeptides, small organic or inorganic 
molecules, or antibodies (including, for example, 
polyclonal, monoclonal, humanized, anti-idiotypic, chimeric 
or single chain antibodies, and FAb, F(ab')2 and FAb 
expression library fragments, and epitope-binding fragments 
thereof) . Negative modulatory techniques involving antibody 
administration are described, below, in Section 5.9.1.2. 
Techniques for the determination and administration of such 
compounds are described, below, in Section 5.10. 

Further, antisense and ribozyme molecules which 
inhibit expression of the target gene can also be used in 
accordance with the invention to reduce the level of target 
gene expression, thus effectively reducing the level of 
target gene activity. Still further, triple helix molecules 
can be utilized in reducing the level of target gene 
activity, such techniques are described, below, in Section 

5.9.1.1. 
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e a 1 1 NEGATIVE MODULATORY AMTISENSB, 

g^gn^CS TP---' "^T.TT APPROACHES 

Among the compounds which can exhibit the ability to 
prevent and/or ameliorate symptoms of tumor progression are 
antisense, ribozyme, and triple helix molecules. Such 
molecules can be designed to reduce or inhibit either wxld 
type, or if appropriate, mutant target gene activity. 
Techniques for the production and use of such molecules are 
well knovm to those of skill in the art. 

Anti-sense RNA and DNA molecules act to directly 
block the translation of mRNA by hybridizing to targeted 
mSNA and preventing protein translation. With respect to 
antisense DNA, oligodeoxyribonucleotides derived from the 
translation initiation site, between the -10 and .10 

regions of the Target gene nucleotide sequence of interest, 

are preferred. 

Bibozymes are enzymatic RHA molecules capable of 
catalyzing the specific cleavage of ENA. (For a review 
see, for example, Rossi, J., 1994, Current Biology 4:469- 
471) The mechanism of rlbozyme action Involves sequence 
specific hybridization of the rlbozyme molecule to 
complementary target ESA, followed by a endonucleolytic 
cleavage. The composition of rlbozyme molecules must 
include one or more sequences complementary to the target 
gene mEMA and must Include the well-known catalytic sequence 
responsible for mEHA cleavage. For this sequence, see U.S. 
pat. NO. 5,093,246, which is incorporated by 
herein in its entirety. As such within the scope of the 
invention are engineered hammerhead motif rlbozyme molecules 
that specifically and efficiently catalyze endonucleolytic 
cleavage of ENA sequences encoding target gene proteins. 

specific rlbozyme cleavage sites within any 
potential RNA target are initially identified by scanning 
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the mol cule of interest for ribozyme cleavage sites which 
include the following sequences, GUA, GUU, and GUC. Once 
identified, short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target 
5 gene containing the cleavage site can be evaluated for 

predicted structural features, such as secondary structure, 
that can render the oligonucleotide sequence unsuitable. 
The suitability of candidate sequences can also be evaluated 
by testing their accessibility to hybridization with 
10 complementary oligonucleotides, using ribonuclease 

protection assays. 

Nucleic acid molecules to be used in triplex helix 
formation for the inhibition of transcription should be 
single stranded and composed of deoxynucleotides. The base 
15 composition of these oligonucleotides must be designed to 
promote triple helix formation via Hoogsteen base pairing 
rules, which generally require sizeable stretches of either 
purines or pyrimidines to be present on one strand of a 
duplex. Nucleotide sequences can be pyrimidine-based, which 
20 will result in TAT and CGC+ triplets across the three 
associated strands of the resulting triple helix. The 
pyrimidine-rich molecules provide base complementarily to a 
purine-rich region of a single strand of the duplex in a 
parallel orientation to that strand. In addition, nucleic 
25 acid molecules can be chosen that are purine-rich, for 

example, contain a stretch of G residues. These molecules 
will form a triple helix with a DNA duplex that is rich in 
GC paris, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting 
30 in GGC triplets across the three strands in the triplex. 

Alternatively, the potential sequences that can be 
targeted for triple helix formation can be increased by 
creating a so called "switchback" nucleic acid molecule. 
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Switchbaclc molecules are synthesized in an alternating 5'- 
3' , 3'-5' manner, such that they base pair with first one 
strand of a duplex and then the other, eliminating the 
necessity for a sizeable stretch of either purines or 
pyrimidines to be present on one strand of a duplex. 

In instances wherein the antisense, ribozyme, and/or 
triple helix molecules described herein are utilized to 
reduce or inhibit mutant gene expression, it is possible 
that the technique utilized can also efficiently reduce or 
inhibit the transcription (triple helix) and/or translation 
(antisense, ribozyme) of mRNA produced by normal target gene 
alleles such that the possibility can arise wherein the 
concentration of normal target gene product present can be 
lower than is necessary for a normal phenotype. In such 
cases, to ensure that substantially normal levels of target 
gene activity are maintained, nucleic acid molecules that 
encode and express target gene polypeptides exhibiting 
normal target gene activity can be introduced into cells via 
gene therapy methods such as those described, below, in 
section 5.9.2 that do not contain sequences susceptible to 
whatever antisense, ribozyme, or triple helix treatments are 
being utilized. Alternatively, in instances whereby the 
target gene encodes an extracellular protein, it can be 
preferable to coadminister normal target gene protein into 
the cell or tissue in order to maintain the requisite level 
of cellular or tissue target gene activity. 

Anti-sense RNA and DNA, ribozyme and triple helix 
molecules of the invention can be prepared by any method 
known in the art for the synthesis of DNA and RNA molecules. 
These include techniques for chemically synthesizing 
oligodeoxyribonucleotides and oligoribonucleotides well 
known in the art such as, for example, solid phase 
phosphoramidite chemical synthesis. ' Alternatively, RNA 
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molecules can be generated by in vjtyQ and in viyfi 
transcription of DNA sequences encoding the antisense RNA 
molecule. Such DNA sequences can be incorporated into a 
wide variety of vectors which incorporate suitable RNA 
polymerase promoters such as the T7 or SP6 polymerase 
promoters. Alternatively, antisense cDNA constructs that 
synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stably 

into cell lines. 

Various well-known modifications to the DNA 
molecules can be introduced as a means of increasing 
intracellular stability and half -life. Possible 
modifications include but are not limited to the addition of 
flanking sequences of ribo- or deoxy- nucleotides to the 5' 
and/or 2' ends of the molecule or the use of 
phosphorothioate or 2' 0-methyl rather than phospho- 
diesterase linkages within the oligodeoxyribonucleotide 
backbone . 

5.9.1.2. MBGATIVB MQPniATORY AITPIBODY TECHNIQUES 

Antibodies can be generated which are both specific 
for target gene product and which reduce target gene product 
activity. Such antibodies may, therefore, by administered 
in instances whereby negative modulatory techniques are 
appropriate for the treatment of tumor progression. 
Antibodies can be generated using standard techniques 
described in Section 5.6, above, against the proteins 
themselves or against peptides corresponding to portions of 
the proteins. The antibodies include but are not limited to 
polyclonal, monoclonal. Fab fragments, single chain 
antibodies, chimeric antibodies, and the like. 

In instances where the target gene protein to which 
the antibody is directed is intracellular and whole 



antibodies are used, internalizing antibodies can be 
pr f rr d. However, lipofectin or liposomes can be used to 
deliver the antibody or a fragment of the Fab region which 
binds to the target gene epitope into cells. Where 
5 fragments of the antibody are used, the smallest inhibitory 
fragment which binds to the target protein's binding domain 
is preferred. For example, peptides having an amino acid 
sequence corresponding to the domain of the variable region 
of the antibody that binds to the target gene protein can be 
10 used, such peptides can be synthesized chemically or 

produced via recombinant DNA technology using methods well 
known in the art (e^, see Creighton, 1983, supra; and 
SambrooJc et al. . 1989, supra). Alternatively, single chain 
neutralizing antibodies which bind to intracellular target 
15 gene product epitopes can also be administered. Such 

single chain antibodies can be administered, for example, by 
expressing nucleotide sequences encoding single-chain 
antibodies within the target cell population by utilizing, 
for example, techniques such as those described in Marasco 
20 et al. (Marasco, W. et al., 1993, Proc. Natl. Acad. Sci. USA 

90:7889-7893) . 

In instances where the target gene protein is 
extracellular, or is a transmembrane protein, any of the 
administration techniques described, below in Section 5.10 
25 which are appropriate for peptide administration can be 
utilized to effectively administer inhibitory target gene 
antibodies to their site of action. 

POSITTVT! MQDIIIAT npy TBGHNIOUE8 
AS discussed above, successful treatment of tumor 
30 progression symptoms and of disorders involving tumor 

progression can be brought about by techniques which serve 
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to increase the level of target gene expression or to 
increase the activity of a target gene product. 

For example, compounds such as those identified 
through assays described, above, in Section 5.8, which 
exhibit positive modulatory activity can be used in 
accordance with the invention to ameliorate tumor 
progression symptoms. As discussed in Section 5.8, above, 
such molecules can include, but are not limited to, 
peptides, phosphopeptides, small organic or inorganic 
molecules, or antibodies (including, for example, 
polyclonal, monoclonal, humanized, anti-idiotypic, chimeric 
or single chain antibodies, and FAb, F(ab')2 and FAb 
expression library fragments, and epitope-binding fragments 
thereof) . Positive modulatory techniques involving antibody 
administration are described, below, in Section 5.9.2.1. 

For example, a target gene protein, at a level 
sufficient to ameliorate tumor progression symptoms can be 
administered to a patient exhibiting such symptoms. Any of 
the techniques discussed, below, in Section 5.10, can be 
utilized for such administration. One of skill in the art 
will readily know how to determine the concentration of 
effective, non-toxic doses of the normal target gene 
protein, utilizing techniques such as those described, 

below, in Section 5.10.1. 

In instances wherein the compound to be administered 
is a peptide compound, DNA sequences encoding the peptide 
compound can, alternatively, be directly administered to a 
patient exhibiting tumor progression symptoms, at a 
concentration sufficient to generate the production of an 
amount of target gene product adequate to ameliorate tumor 
progression symptoms. Any of the techniques described, 
below, in Section 5.10, which achieve intracellular 
administration, can be utilized for the administration of 



such DNA molecules. The DNA molecules can be produced, for 
example, by well-known recombinant techniques. 

In the case of peptide compounds which act 
extracellular ly, the DNA molecules encoding such peptides 
can be taken up and expressed by any cell type, so long as a 
sufficient circulating concentration of peptide results for 
the elicitation of a reduction in tumor progression 
symptoms . 

In the case of compounds which act intracellular ly, 
the DNA molecules encoding such peptides must be taken up 
and expressed by cells involved in the tumor progression at 
a sufficient level to bring about the reduction of tumor 
progression symptoms. 

Any technique which serves to selectively administer 
DNA molecules to a cell involved in tumor progression is, 
therefore, preferred for the DNA molecules encoding 
intracellular ly acting peptides. 

Further, patients can be treated for symptoms of 
tumor progression by gene replacement therapy. One or more 
copies of a normal target gene or a portion of the gene that 
directs the production of a normal target gene protein with 
target gene function can be inserted into cells, using 
vectors which include, but are not limited to adenovirus, 
adeno-associated virus, and retrovirus vectors, in addition 
to other particles that introduce DNA into cells, such as 
liposomes. Techniques such as those described above can be 
utilized for the introduction of normal target gene 
sequences into human cells. 

In instances wherein the target gene encodes an 
extracellular, secreted gene product, such gene replacement 
techniques may be accomplished either in vivo or in vitro. 
For such cases, the cell types expressing the target gene is 
less important than achieving a sufficient circulating 
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concentration of th extracellular molecules for the 
am lioration of tumor progression symptoms to occur. In 
vitro, targ t gen sequences can be introduced into 
autologous cells. Those cells expressing the target gene 
sequence of interest can then be reintroduced, preferably by 
intravenous administration, into the patient such that there 
results an amelioration of tumor progression symptoms. 

in instances wherein the gene replacement involves a 
gene which encodes a product which acts intracellular ly, it 
is preferred that gene replacement be accomplished m vivo. 
Further, because the cell type in which the gene replacement 
n^ust occur is the cell type involved in tumor progression, 
such techniques must successfully target such tumor 
progression cells. 

Taking the 030 gene as an example, an increase in 
030 expression can serve to ameliorate tumor progression 
symptoms, such as, for example, tumor progression symptoms 
involving metastatic processes. Therefore, any positive 
modulatory described herein which increases the 030 gene 
product or gene product activity to a level which is 
sufficient to ameliorate tumor progression symptoms 
represents a successful tumor progression therapeutic 
treatment . 

5 9 3. METHODS FOR DBPLBTIMG CELLS 

Techniques described herein can be utilized to 
deplete the total number of cells involved in tumor 
progression, thus effectively decreasing the ratio of the 
tumor cells to non-cancerous cells. Specifically, 
separation techniques are described which can be used to 
deplete the total number of tumor cells present within a 
cell population, and, further, targeting techniques are 
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biotin-streptavidin separations using biotin conjugated to 
cell surface marker-specific antibodies and avidin or 
streptavidin bound to a solid support such as affinity 
column matrix or plastic surfaces or magnetic separations 
using antibody-coated magnetic beads. 

separation via antibodies for specific markers can 
be by negative or positive selection procedures. In 
negative separation, antibodies are used which are specific 
for markers present on undesired cells, in this case tumor 
cells, which exhibit, for example, the tumor specific cell 
surface marker. Cells bound by an antibody to such a cell 
surface marker can be removed or lysed and the remaining 
desired mixture retained. In positive separation, 
antibodies specific for markers present on the desired cells 
of interest, in this case tumor-like cells, are used. Cells 
bound by the antibody are separated and retained. It will 
be understood that positive and negative separations can be 
used substantially simultaneously or in a sequential manner. 

A common technique for antibody based separation is 
the use of flow cytometry such as by a florescence activated 
cell sorter (FACS) . Typically, separation by flow cytometry 
is performed as follows. The suspended mixture of cells are 
centrifuged and resuspended in media. Antibodies which are 
conjugated to fluorochrome are added to allow the binding of 
the antibodies to specific cell surface markers. The cell 
mixture is then washed by one or more centrifugation and 
resuspension steps. The mixture is run through a FACS which 
separates the cells based on different fluorescence 
characteristics. FACS systems are available in varying 
levels of performance and ability, including multi-color 
analysis. The facilitating cell can be identified by a 
characteristic profile of forward and side scatter which is 
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example, buffered saline, and reintroduced Into the patient 
via intravenous administration. 

cells to be expanded can be cultured, using standard 
procedures, in the presence of an appropriate expansion 
Lent Which induces proliferation of the purified normal 
cell subpopulation. Such an expansion agent can, for 
example, be any appropriate cytoKlne, antigen, or antibody. 

prior to being reintroduced into a patient, the 
purified normal cells can be modified by, for example, 
transformation «ith gene seguences encoding gene products of 
interest, such gene products should represent products 
Which enhance the activity of the purified normal cell 
subpopulation or, alternatively, represent products which 
repress the activity of one or more of the other normal cell 
sulpopulations. cell transformation and gene 
procedures are well Known to those of sKiU in the art, and 
can be as those described, above, in section 5.2. 

well-known targeting methods can, additionally, be 
utilized in instances wherein the goal is to deplete the 
number of cells belonging to a specific tumor cell 
subpopulation. ■ Such targeting methods can be - — " - 
vitro! and can Involve the introduction of targeting agents 
into a population of cells such that the target ng agents 
Selectively destroy a specific subset of the cells within 

ToPUlatlon. in Vivo administration J" 
he followed for such targeting agents are described, below. 

In section 5.10. . 

Targeting agents generally comprise, first, a 
targeting moiety which, in the current instance, causes the 
targeting agent to selectively associate with a specif c 
:Z ce!l Lbpopulation. The targeting agents .-era 

compri cond, a moiety capable of destroying a cell with 

which th. targeting agent has become associated. 
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Targeting moieties can include, but are not limited 
to antibodies directed to cell surface markers found 
specifically on the tumor cell subpopulation being targeted 
or, alternatively, to ligands, such as growth factors, which 
bind receptor-type molecules found exclusively on the 
targeted tumor cell subpopulation. 

Destructive moieties include any moiety capable of 
inactivating or destroying a cell to which the targeting 
agent has become bound. For example, a destructive moiety 
can include, but it Is not limited to cytotoxins or 
radioactive agents. Cytotoxins include, '"/-"P^^' 

fungus-, or bacteria-derived toxins, with deglycosylated 
;iLn^ ciam toxins being generally preferred due to their 
potency and lengthy half -lives. 

5.10. PHARMACEUTICAL PRSP^^"?;!^., 
* i^wn MBTHQD8 ^nurMTaTRATION 

The identified compounds that inhibit target gene 
expression, synthesis and/or activity can be administered to 
alatient at therapeutically effective doses to prevent 
treat or ameliorate tumor progression. A therapeutically 
effective dose refers to that amount of the compound 
sufficient to result in amelioration of symptoms of tumor 
progression. 

5.10.1. F^^^rT^^ PQS' 
Toxicity and therapeutic efficacy of such compounds 
can be determined by standard pharmaceutical procedures in 
cell cultures or experimental animals, ^, for 
the U>,„ (the dose lethal to 50, of the P°PU---> 
ED „ (the dose therapeutically effective in 50% of the 
, ropulatlon,. The dose ratio between toxic ana therapeutic 
Lfect, is the therapeutic Index and It can be expressed as 
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the ratio LDso/^'so- Compounds which exh.b.t large 
therapeutic indices are preferred. While compounds that 
ZiZt toxic Side effects can he used, care should be taKen 
to design a delivery system that targets such compound, to 
the site of affected tissue in order to minimize potential 
damage to uninfected cells and, thereby, reduce side 

effects. , 
The data obtained from the cell culture assays and 

animal studies can be used in formulating a range of dosage 
for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that 
fnlde the ED,„ with little or no toxicity. The dosage can 
vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any 
compound used In the method of the Invention, the 
therapeutically effective dose can be estimated initially 
from cell culture assays. A dose can be formulated in 
animal models to achieve a circulating plasma """" "f "h 
range that Includes the IC,„ (1^, the concentration of the 
test compound which achieves a half-maximal inhibition of 
symptoms, as determined in cell culture. Such information 
cl^ be used to more accurately determine useful doses in 
humans. Levels In plasma can be measured, for example, by 
high performance liquid chromatography. 

5.10.2. rTPunT.nTioUji *M> "SB 

Pharmaceutical compositions for use in accordance 
With the present invention can be formulated i" 
manner using one or more physiologically acceptable carriers 

or excipients. , 

Thus, the compounds and their physiologically 
acceptable salts and solvates can be formulated for 
irinistration by inhalation or insufflation (either through 
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th mouth or the nose) or oral, buocal, parent ral or rectal 

administration. , 

For oral administration, the pharmaceutical 
compositions can taXe the form of, for example, tablets or 
5 capsules prepared by conventional means with 

pharmaceutically acceptable excipients such as binding 
agents (e^, pregelatinised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose) ; 
fillers U=tose, microcrystalline cellulose or 

10 calcium hydrogen phosphate) ; lubricants magnesium 
stearate, talc or silica); dislntegrants (e^, potato 
starch or sodium starch glycolate) ; or wetting agents (a^, 
sodium lauryl sulphate) . The tablets can be coated by 
methods well Xnown in the art. Liguid 
15 administration can taKe the form of, for "^^''/'/f 
syrups or suspensions, or they can be presented as a dry 
product for constitution with water or other suitable 
vehicle before use. Such liquid preparations can be 
prepared by conventional means with pharmaceutically 
20 acceptable additives such as suspending agents (e^, 

sorbLl syrup, cellulose derivatives or hydrogenated edible 
fats), emulsifying agents (s^, lecithin or acacia), non- 
aqueous vehicles (s^, almond oil, oily esters, «thyl 
arcohol or fractionated vegetable oils) ; and 
25 B^thyl or propyl-p-hydroxybenzoate. or sorbic acid) . 

iiif^ieparations can also contain buffer salts, flavoring, 
coloring and sweetening agents as appropriate. 

preparations for oral administration can be suitably 
formulated to give controlled release of the active 
30 ccmpound^.^^ ^^^^^^ ^aministration the compositions can take 
the form of tablets or lozenges formulated in conventional 
manner . 
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For administration by Inhalation, the compounds tor 

„.„„ilna to th. present Invention are conveniently 
use according to th P ^ presentation from 

::er::erprc:sra ne.uii.er, With the use o. a suitable 

trionioi">- pressurized 
dioxide or other suitable gas. In the case oi P 

, .he dosage unit can be determined by providing a 
"r"L deUver a metered amount, capsules and cartridges 
::tg! in .or use in an inhaler or insu„lator can be 

7o^Jte. containing a powder mix of the compound and a 

suitable powder base such as lactose or starch. 

The compounds can be formulated tor parenteral 

administration by injection. ^. by bolus 

continuous intusion ^''^^J^^^^^V'^^^^^ . 

irir e — Hi^; .eservativ,. 

.rmulatory agents can 
rin:irtr totrnLtutiln with > sultabU vehicU, 

WA-FrtT-o use. 



sterile pyrogen-free water, before use. 
The compounds can also be formulated in rectal 
compositions such as suppositories or retention enemas 
12. containing conventional suppository bases such as 
cocoa butter or other glycerides. 

in addition to the formulations described 

^ also be formulated as a depot 

nreviously, the compounds can also oe lor 
previou y, formulations can be 

preparation. Such long y example, subcutaneous ly or 

administered by implantation (for 

....amuscularly) or by 

example, the compounds can be formulated wit 
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polyBerlc or hydrophobic materials (for example as an 
Lui:io„ in an acceptable oil, or ion exchan,. - " 

sparingly soluble derivatives, for exampl , as a sparingly 

;h?c'o.positions can, if desired, be presented in a 
V or dispenser device «hich can contain one or »ore unit 
To g ftr containing the active ingredient. The pac. can 
for example comprise metal or plastic foU, such as 
blister pacK. The pacK or dispenser device can be 
, accompanied by instructions for administration. 

5.11. r.r.n«nnTg or T'~"1^ yunnnFaaiOM 

A variety of methods can be employed for the 

diagnosis of tumor progression and ^^^^^^^'^^J^^^^r 
tumor progression, including metastatic diseases Such 

' rg-rtg::: z::^.:^^ tn 

5 " 1 and altibodies directed against differentially 

:^;;s:d and pathway gene peptides, as described, above, in 

Section 5.2.1 (peptides, and 5.2.3 '-'^"f 
n ^eciflcally, such reagents can be used, for example, tor 
° the ec io: of the presence of target gene - 

the detection of either over or under expression of target 

r'methods described herein can be performed, for 
V,., „*-ilizina pre-packaged diagnostic kits 

z^.:^^^ rres:::^" 

acid or anti-fingerprint gene antibody '^^^^ ^^'^^^^^^ 
herein, which can be conveniently used, a^, m clini 
settin;^. to diagnose patients exhibiting symptoms of 
30 metastatic diseases. 
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Any cell type or tissue, preferably T-cells, in 
which the fingerprint gene is expressed can be utilized in 
the diagnostics described below. 

5.11.1. DETECTION OP PINQERPRINT 
rsBWB WTTCI BTe ACIDS 

DNA or RNA from the cell type or tissue to be 
analyzed can easily be isolated using procedures which are 
well known to those in the art. Diagnostic procedures can 
also be performed "in situ" directly upon tissue sections 
(fixed and/or frozen) of patient tissue obtained from 
biopsies or resections, such that no nucleic acid 
purification is necessary. Nucleic acid reagents such as 
those described in Section 5.1 can be used as probes and/or 
primers for such in sity procedures (see, for example, 
Nuovo, G.J., 1992, PGR in ^ hybridization: Protocols and 
Applications, Raven Press, NY) . 

Fingerprint gene nucleotide sequences, either RNA or 
DNA, can, for example, be used in hybridization or 
amplification assays of biological samples to detect gene 
structures and expression associated with metastasis. Such 
assays can include, but are not limited to, Southern or 
Northern analyses, single stranded conformational 
polymorphism analyses, in ^ hybridization assays, and, 
polymerase chain reaction analyses. Such analyses can 
reveal both quantitative aspects of the expression pattern 
of the fingerprint gene, and qualitative aspects of the 
fingerprint gene expression and/or gene composition. That 
is, such techniques can include, for example, point 
mutations, insertions, deletions, chromosomal 
rearrangements, and/or activation or inactivation of gene 
expression. 
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Preferred diagnostic methods for the detection of 
fingerprint gene-specific nucleic acid molecules can involve 
for example, contacting and incubating nucleic acids, 
derived from the cell type or tissue being analyzed, with 
one or more labeled nucleic acid reagents as are described 
in section 5.1, under conditions favorable for the specific 
annealing of these reagents to their complementary sequences 
within the nucleic acid molecule or interest. Preferably, 
the lengths of these nucleic acid reagents are at least 15 
to 30 nucleotides. After incubation, all non-annealed 
nucleic acids are removed from the nucleic acid: f ingerprxnt 
RNA molecule hybrid. The presence of nucleic acids from the 
target tissue which have hybridized, if any such molecules 
exist, is then detected. Using such a detection scheme, the 
nucleic acid from the tissue or cell type of interest can be 
immobilized, for example, to a solid support such as a 
membrane, or a plastic surface such as that on a mxcrotxtre 
plate or polystyrene beads. In this case, after incubation, 
non-annealed, labeled fingerprint nucleic acid reagents of 
the type described in Section 5.1 are easily removed. 
Detection of the remaining, annealed, labeled nucleic acid 
reagents is accomplished using standard techniques well- 
known to those in the art. . ^ ^ . «^ 
Alternative diagnostic methods for the detection of 
fingerprint gene specific nucleic acid molecules can involve 
their amplification, e^, by PGR (the experimental 
embodiment set forth in Mullis, K.B., 1987, U.S. Patent No. 
4,683,202), ligase chain reaction (Barany, F., 1991, Proc. 
Natl. Acad. Sci. USA M: 189-193), self sustained -quence 
replication (Guatelli, J.C. 1990, Proc. Natl. Acad. 
sci. USA si: 1874-1878), transcriptional amplification system 
(Kwoh, D.Y 1989, proc. Natl. Acad. Sci. USA M:1173- 
1177), Q-Beta Replicase (Lizardi, P.M. etL^, 1988, 
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Bi /Technology 4:1197), or any other nucl ic acid 
amplification method, followed by the detection of the 
amplified molecul s using techniques well known to those of 
skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 

in one embodiment of such a detection scheme, a cDNA 
molecule is obtained from an RNA molecule of interest (e^, 
by reverse transcription of the RNA molecule into cDNA) . 
cell types or tissues from which such RNA can be isolated 
include any tissue in which wild type fingerprint gene is 
known to be expressed. A sequence within the cDNA is then 
used as the template for a nucleic acid amplification 
reaction, such as a PGR amplification reaction, or the like. 
The nucleic acid reagents used as synthesis initiation 
reagents primers) in the reverse transcription and 

nucleic acid amplification steps of this method are chosen 
from among the fingerprint gene nucleic acid reagents 
described in Section 5.1. The preferred lengths of such 
nucleic acid reagents are at least 19-30 nucleotides. For 
detection of the amplified product, the nucleic acid 
amplification can be performed using radioactively or non- 
radioactively labeled nucleotides. Alternatively, enough 
amplified product can be made such that the product can be 
visualized by standard ethidium bromide staining or by 
utilizing any other suitable nucleic acid staining method. 

in addition to methods which focus primarily on the 
detection of one nucleic acid sequence, fingerprint 
profiles, as discussed in Section 5.3.4., can also be 
assessed in such detection schemes. Fingerprint profiles 
can be generated, for example, by utilizing a differential 
display procedure, as discussed above in 5.1.1.2, Northern 
analysis and/or RT-PCR. Any of the gene sequences 
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described, above, in Section 5.2.1 can be used as probes 
and/or PCR prim rs for the generation and corroboration of 
such fingerprint profiles. 

5.11.2. nBTEGTIOK Qg TARGg T GENE PEPTIDE8 
Antibodies directed against wild type or mutant 
fingerprint gene peptides, which are discussed, above, in 
Section 5.2.3, can also be used in tumor progression 
diagnostics and prognostics, as described, for example, 
herein. Such diagnostic methods, can be used to detect 
abnormalities in the level of fingerprint gene protein 
expression, or abnormalities in the structure and/or tissue, 
cellular, or subcellular location of fingerprinting gene 
protein. Structural differences can include, for example, 
differences in the size, electronegativity, or antigenicity 
of the mutant fingerprint gene protein relative to the 
normal fingerprint gene protein. 

Protein from the tissue or cell type to be analyzed 
can easily be isolated using techniques which are well known 
to those of skill in the art. The protein isolation methods 
employed herein can, for example, be such as those described 
in Harlow and Lane (Harlow, E. and Lane, D., 1988, 
"Antibodies: A Laboratory Manual," Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York), which is 
incorporated herein by reference in its entirety. 

Preferred diagnostic methods for the detection of 
wild type or mutant fingerprint gene peptide molecules can 
involve, for example, immunoassays wherein fingerprint gene 
peptides are detected by their interaction with an anti- 
fingerprint gene specific peptide antibody. 

For example, antibodies, or fragments of antibodies, 
such as those described, above, in Section 5.2.3, useful in 
the present invention can be used to quantitatively or 
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qualitatively detect the presence of wild type or mutant 
fingerprint gene peptides. This can be accomplished, for 
exampl , by immunofluorescence techniques employing a 
f luorescently labeled antibody (see below) coupled with 
light microscopic, flow cytometric, or fluorimetric 
detection. Such techniques are especially preferred if the 
fingerprint gene peptides are expressed on the cell surface. 

The antibodies (or fragments thereof) useful in the 
present invention can, additionally, be employed 
histologically, as in immunofluorescence or immunoelectron 
microscopy, for in situ detection of target gene peptides, 
in sitii detection can be accomplished by removing a 
histological specimen from a patient, and applying thereto a 
labeled antibody of the present invention. The antibody (or 
fragment) is preferably applied by overlaying the labeled 
antibody (or fragment) onto a biological sample. Through 
the use of such a procedure, it is possible to determine not 
only the presence of the fingerprint gene peptides, but also 
their distribution in the examined tissue. Using the 
present invention, those of ordinary skill will readily 
perceive that any of a wide variety of histological methods 
(such as staining procedures) can be modified in order to 
achieve such in situ detection. 

immunoassays for wild type or mutant fingerprint 
gene peptides typically comprise incubating a biological 
sample, such as a biological fluid, a tissue extract, 
freshly harvested cells, or cells which have been incubated 
in tissue culture, in the presence of a detectably labeled 
antibody capable of identifying fingerprint gene peptides, 
and detecting the bound antibody by any of a number of 
techniques well-known in the art. 

The biological sample can be brought in contact with 
and immobilized onto a solid phase support or carrier such 
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as nitrocellulose, or other solid support which is capable 
of immobilizing cells, cell particles or soluble proteins. 
The support can then be washed with suitable buffers 
followed by treatment with the detectably labeled 
5 fingerprint gene specific antibody. The solid phase support 
can then be washed with the buffer a second time to remove 
unbound antibody. The amount of bound label on solid 
support can then be detected by conventional means. 

By "solid phase support or carrier" is intended any 

10 support capable of binding an antigen or an antibody. Well- 
known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, 
natural and modified celluloses, polyacrylamides, gabbros, 
and magnetite. The nature of the carrier can be either 

15 soluble to some extent or insoluble for the purposes of the 
present invention. The support material can have virtually 
any possible structural configuration so long as the coupled 
molecule is capable of binding to an antigen or antibody. 
Thus, the support configuration can be spherical, as in a 

20 bead, or cylindrical, as in the inside surface of a test 

tube, or the external surface of a rod. Alternatively, the 
surface can be flat such as a sheet, test strip, etc. 
Preferred supports include polystyrene beads. Those skilled 
in the art will know many other suitable carriers for 

25 binding antibody or antigen, or will be able to ascertain 
the same by use of routine experimentation. 

The binding activity of a given lot of anti-wild 
type or mutant fingerprint gene peptide antibody can be 
determined according to well-known methods. Those skilled 

30 in the art will be able to determine operative and optimal 
assay conditions for each determination by employing routine 
experimentation • 
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* wAvs in which the fingerprint gene 
.eptiaeZr^rarX; can .e ae.ec«.X. .a^ua U 
Un^ n, t^e sa.e to an enzyme and use in an en.y»e 

(EIA) (Voller, A., "The Enzyme Linked 

immunosorbent Assay l n„,rterlv Publication, 

„,S) (Microblolcical p^,,^,. 

Walkersville, MD , Veil . ^^^^ 73:482-523 

.cm-p^^o fl978); Butler, J.£*w necii* 

31:507-520 (1978) , jMMUNOASSAy , CRC Press, 
(1981); Maggio, E. (ed,) , ENZYM ^^^^^^ 

iQftO* Ishikawa, E. §S-sAj.> v*"' ' 
Boca Raton, FL, 1980, isni enzyme which is 

IMMUNOASSAY, Kgaku Shoin, Tokyo, 1981) . The enzym 

IMMUNOASi>A , ^ith an appropriate 

,ound to "^^^^^^^^ r Lmogenic suhstrate, in such a 
substrate, preferably ^^.^^ 

xnanner as to produce a ^^^^^^^i^, f luorimetric 

detected, for example, by spectrop detectably 

TTn^vmAs which can be usea t^o 
or by visual means. ^"^V"^^ ..^^^.^ to, malate 

label the antibody include, but are . ....g.^teroid 
dehydrogenase, staphylococcal nuclease, delta 5 

^".s^a:: ^rrerr:::::; rsphate isomerase, 

glycerophosphate, deny ' ^^^e, asparaginase, 

horseradish peroxidase, a^ -^^^^^^^^^ ribonuclease, urease, 
glucose oxidase, "^^"^^^ dehydrogenase, glucoamylase 

catalase, gl-""-*-^''""^'''" "^^^^ Jio„ ean be accomplished 
and acetylcholinesterase The detecti 

1 ^««fT-ir methods which employ a cntwm ^ 
,y colorimetric ^ accomplished by 
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(see, for example, Weintraub, B., Principles of 
Radioimmunoassays, Seventh Training Course on Radioligand 
Assay Techniques, The Endocrine Society, March, 1986, which 
is incorporated by reference herein) . The radioactive 
isotope can be detected by such means as the use of a gamma 
counter or a scintillation counter or by autoradiography. 

It is also possible to label the antibody with a 
fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave length, its 
presence can then be detected due to fluorescence. Among 
the most commonly used fluorescent labeling compounds are 
fluorescein isothiocyanate, rhodamine, phycoerythrin, 
phycocyanin, allophycocyanin, fi-phthaldehyde and 

f luorescamine . 

The antibody can also be detectably labeled using 

fluorescence emitting metals such as ^"eu, or others of the 

lanthanide series. These metals can be attached to the 

antibody using such metal chelating groups as 

diethylenetriaminepentacetic acid (DTPA) or 

ethylenediaminetetraacetic acid (EDTA) . 

The antibody also can be detectably labeled by 
coupling it to a chemi luminescent compound. The presence of 
the chemiluminescent-tagged antibody is then determined by 
detecting the presence of luminescence that arises ^uring 
the course of a chemical reaction. Examples of particularly 
useful chemiluminescent labeling compounds are luminol, 
isoluminol, theromatic acridinium ester, imidazole, 
acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound can be used to 
label the antibody of the present invention. 
Bioluminescence is a type of chemi luminescence found in 
biological systems in, which a catalytic protein increases 
the efficiency of the chemiluminescent reaction. The 
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presence of a bioluninescent protein is determined by 
detecting the presence of luminescence. Important 
bioluminescent compounds for purposes of labeling are 
luciferin, luciferase and aequorin. 

6. EXAMPLE I IDENTIPICATION AND 

CHARACTERIZATION OP A NOVEL GENE 
THAT INHTBTTfl TUM ^P PPOGRBSSION 

in the Example presented in this Section, the in 
vitro paradigm, described, above, in Section 5.1.1.1, was 
utilized to identify a gene, designated herein as the 030 
gene, which is differentially expressed in cells with a high 
metastatic potential relative to cells having a low 
metastatic potential. Specifically, the 030 gene is 
expressed in high metastatic potential cells at a rate which 
is many-fold lower than it is expressed in non-metastatic 
cells Thus, as discussed below, the 030 gene can encode a 
product important to a number of neoplastic processes, 
including, for example, the progression of a cell to a 
metastatic state, the aggressiveness of a cell's metastatic 
state, and the ability of a primary tumor cell to invade 
surrounding tissue. Given the differential 030 gene 
expression pattern revealed in this Section, the 030 gene 
product can represent a protein having tumor suppressor or 
inhibitor function. 

6.1. MllTBTtTALS > Mp MBTH0D8 
6.1.1. rWT.T. cnLTURg 
B16 Fl and B16 FIO melanoma cell lines were 
maintained in culture in Eagle's minimal essential medium 
(MEM) supplemented with 10% fetal calf serum. Cells were 
harvested from nonconfluent monolayers by a two minute 
treatment with 0.25% trypsin and 2mM EDTA. 
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For further characterization of in vivo activity, 
each cell line was injected into mice. Cells were washed 
two times in MEM, and the final cell suspension adjusted to 
5 X 10^ cells per ml in MEM. Two hundred microliters of 
this cell suspension (1 x 10^ cells) was injected i.v. into 
the lateral tail vein of C57BL/6J mice. After three weeks, 
the mice were sacrificed and their lungs autopsied. The 
number of pulmonary tumors was determined by counting 
surface nodules using a dissecting microscope. 

The differential expression of the 030 gene in B16 
Fl relative to B16 FIO cell lines was compared with the 
extent of pulmonary metastases which developed in 516 Fl- 
injected mice relative to B16 FlO-injected mice. 

6.1.2. niffarent ial Diaplav 

Differential mRNA display was carried out as 
described, above, in Section 5.1.1.2. Details of the 
differential display are given, below. 
Taolation 

RNA was isolated, using RNAzol, from nonconfluent 
monolayers of B16 Fl and B16 FIO cell lines. 

Isolated RNA was resuspended in DEPC HjO and 
quant itated by spectrophotometry at ODjgo- Approximately 
half of the RNA samples were then treated with DNAse I to 
remove contaminating chromosomal DNA. Each 50 m1 RNA sample 
(50 /ig), 5.7 Hi lOx PGR buffer (Perkin-Elmer/Cetus) and 1 /il 
RNAse inhibitor (40 units/Ml; Boehringer Mannheim, Germany) 
were mixed together. Two microliters of DNAse I (10 
units/Ml; Boehringer Mannheim) was added to the reaction 
which was incubated for 30 min. at 37»C. The total volume 
was brought to 200 Ml with DEPC H2O, extracted once with 
phenol/chloroform and precipitated by adding 20 ^1 3M NaOAc, 
pH 4.8, (DEPC-treated) , 500 /il absolute ETOH and incubated 
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for 1 hour on dry ice. The precipitat d sample was 
centrifuged for 15 min., and the pellet was washed with 70% 
ETOH. The sample was re-centrifuged, the remaining liquid 
was aspirated, and the pellet was resuspended in 50 nl HjO. 
The concentration of RNA was measured by reading the ODjgo- 
ytrat strand cDNA synthesis 

For each RNA sample, duplicate reverse transcription 
reactions were carried out in parallel. Four hundred ng RNA 
plus DEPC HjO in a total volume of 10 nl were added to 4 /il 
TjiCC 3' primer (10 mM; Operon) . The mixture was incubated 
at 70"»C for 5 min. to denature the RNA and then placed at 
room temperature. Twenty-six nl of reaction mix containing 
the following components was added to each denatured 
RNA/primer sample: 8 ^il 5x First Strand Buffer (Gibco/BRL, 
Gaithersburg, MD) , 4 /il O.IM DTT (Gibcol/BRL) , 2 /il RNAse 
inhibitor (40 units/MD (Boehringer Mannheim) , 4 Ml 200 iM 
dNTP mix, 6 Ml HjO, 2 Ml Superscript reverse transcriptase 
(200 units/Ml; Gibcol/BRL). The reactions were mixed gently 
and incubated for 30 min. at 42 »C. sixty m1 of HjO, for a 
final volume of 100 m1, was then added and the samples were 
denatured for 5 min. at 85»C and stored at -20 »C. 
PGR reaetiona 

The resulting single stranded cDNA molecules were 
then amplified by PGR. Specifically, 13 m1 of reaction mix 
was added to each tube of a 96 well plate on ice. The 
reaction mix contained 6.4 m1 H2O, 2 Ml lOx PGR Buffer 
(Perkin-Elmer) , 2 Ml 20 MM dNTPs, 0.4 Ml dATP (12.5 
MCi/Ml; 50 MCi total; Dupont/NEN) , 2 Ml 5' primer 0PE4 
(5'GTGACATGCC-3' ; 10 MM; Operon), and 0.2 Ml AmpliTaq" 
Polymerase (5 units/Ml; Perkin-Elmer). Next, 2 Ml of 3' 
primer (TuCC, 10 mM) were added to the side of each tube, 
followed by 5 m1 of cDNA, also to the sides of the tubes, 
which were still on ice. Tubes were capped and mixed, and 
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brought up to 1000 rpm in a centrifuge, then immediately 
returned to ice. A Perkin-Elmer 9600 thermal cycler was 
used, and programmed as follows: 



940c 
*94oC 
*40»C 
*ramp 72''C 
*72»C 
72*C 
40c 



2 min. 
15 sec. * = X 40 

2 min. 

1 min. 
30 sec. 

5 min. 
hold 



When the thermal cycler initially reached 94 »c, the 
96 well plate was removed from ice and placed directly into 
the cycler. Following the amplification reaction, 15 Ml of 
loading dye, containing 80% formamide, 10 mM EDTA, 1 mg/ml 
xylene cyanole, 1 mg/ml bromphenol blue were added. The 
loading dye and reaction were mixed, incubated at 85«»C for 5 
min., cooled on ice, centrifuged, and placed on ice. 
Approximately 4 lil from each tube was loaded onto a pre-run 
(60V) 6% denaturing acrylamide gel. The gel was run at 
approximately 80V until top dye front was about 1 inch from 
bottom. The gel was transferred to 3MM paper (Whatman 
Paper, England) and dried under vacuum. Bands were 
visualized by autoradiography. 

6.1.3. OTHER TECHNIQUES 
Amnlifiad ePMX BaB <^ laolation and Amplification 

PGR bands determined to be of interest in the 
differential display analysis were recovered from the gel 

and reamplified. 

Briefly, differentially expressed bands were excised 
from the dried gel with a razor blade and placed into a 
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microfuge tube with 100 fxl H2O and heated at 100**C for 5 
min., vortexed, heated again to lOO^C for 5 min. , and 
vortexed again. After cooling, 100 /il HjO, 20 m1 3M NaOAc, 
1 Ml glycogen (20 mg/ml) , and 500 /il ethanol were added and 
5 the sample was precipitated on dry ice. After 

centrifugation, the pellet was washed and resuspended in 10 
Ml H2O. 

DNA isolated from the excised differentially 
expressed bands were then reamplified by PGR using the 
10 following reaction conditions: 





58 




H2O 




10 


Ml 


lOx PCR Buffer (see above) 


0 


10 


Ml 


200 /iM dNTPs 




10 


Ml 


10 3' primer (see above) 




15 10 


Ml 


10 MM 5' primer (see above) 


il TS? 


1.5 


Ml 


amplified band 




0.5 


Ml 


AMPLITAQ® polymerase (5 units/Ml; 




(Perkin Elmer) 







PCR conditions were the same as the initial 
20 conditions used to generate the original amplified band, as 
described, above. After reamplif ication, glycerol loading 
dyes were added and samples were loaded onto a 2% 
preparative TAE/Biogel (BiolOl, La Jolla, CA) agarose gel 
and eluted. Bands were then excised from the gel with a 
25 razor blade and vortexed for 15 min. at r.t., and purified 
using the MERMAID™ kit from BiolOl by adding 3 volxames of 
MERMAID" high salt binding solution and 8 Ml of resuspended 
glassfog in a microfuge tube. Glassfog was then pelleted, 
washed 3 times with ethanol wash solution, and then DNA was 
30 eluted twice in 10 Ml at 50**C. 
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The n cloning kit (Invitrogen, San Diego, CA) was 
us d to subclone the aMplifled bands. The ligation reaction 
typloally consisted of 4 Ml sterile HjO, 1 ^1 ligation 
buffer, 2 Ml TA cloning vector, 2 .1 PCR product, and 1 .1 
T4 DNA ligase. The volume ot PCR product can vary, but the 
total volume of PCR product plus HjO was always 6 ,.1. 
Ligations (including vector alone) were incubated overnight 
at 12-C before bacterial transformation. TA cloning kit 
competent bacteria (INV.F' : SBMI. «S^. !lsdEll(r-k, m.k, , 

3^' =*^' *80lacZaAM154(lMZ}£itflI3E). 
deoR+, F') were thawed on ice and 2 ^l of 0.5 M 
mercaptoethanol were added to each tube. Two .1 
ligation were added to each tube ot competent cells (50 , 
mixed without vortexing, and incubated on ice for 30 min. 
Tubes were then placed in 42-C bath for exactly 30 sec 
before being returned to ice for 2 min. Four hundred-fifty 
„1 of see media (Sambrook sLiU, "89- saEM) were then 
added to each tube which were then shaken at 37-C for 1 hr. 
Bacteria were then pelleted, resuspended in approximately 
200 ,1 SOC and plated on Luria broth agar plates containing 
X-gal and 60 amplcllUn and Incubated overnight at 

37.C. White colonies were then picked and screened for 

inserts using PCR. 

A master mix containing 2 Ml 10 x PCR buffer, 1.6 fl 
2.5 mM dHTP's, 0.1 .1 25 mM MgCl,, 0.2 .1 H13 reverse primer 
(100 ng/.l), 0.2 Ml M13 forward primer (100 ng/Ml), 0 1 Ml 
AmpliTaq. (Perkin-Elmer), and 15.8 Ml H,0 was made Forty 
„1 of the master mix were allquoted into tubes of a 96 well 
plate, and whole bacteria were added with a pipette tip 
prior to PCR. The thermal cycler was programmed for insert 
screening as follows: 
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94*C 2 inin. 

*94oc 15 sec. * = X 35 

*47**C 2 min. 

*rainp 72<»C 30 sec. 

itj2^C ^0 sec. 

40c hold 

taction prcduct= were eluted on a 2% agarose gel 
and compared to vector control, colonies with -ctors 
containing Inserts were purified by streaKxng onto LB/Amp 
plates, vectors were isolated from such strains and 
subjected to sequence analysis, using an Applied B.osystems 
Automated sequencer (Applied Biosystems, Inc. Seattle, «A) . 

r lftntno o f W"m*n Gene 

^ human retina cDMA library obtained from Clontech 

was screened using the entire mouse fomy030 ^""^ <«=^:^* 
and 3B, as a probe, curing this screen, one million library 
phage were screened, 53 of which were found to hybridize 
lil the mouse tomy030 probe. The oDKA inserts for eight of 
these positives were isolated, subcloned, and sequenced. 

comparison of the murine fomy030 and human f ohy030 
sequences demonstrated a high degree of sequence 
<e« Identical at the nucleotide level and 94.4% identical 
at the amino add level) within the 5', 1813 base pairs of 
their CDNAS. However, beyond this point the sequences 
diverge and share no significant similarity. The sequence 
of fomy030 at the point of divergence is STAG, which 
corresponds to a consensus splice donor site. 

\ Th;«-taJependent library isolated cDNAs, as well as 

^ ji T7r~^Irp-B™i«ct_were found to contain 

a cDNA Isolated as a 3' RACE proaucusgr ,„.»,„. 

the fomy030 sequence. Thus, the most pro^ibIr-«*ianation 
for the divergence of the human and murine sequences T»^h* 
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15 



20 



existenc of-v^ternat splice forms of the fomy030 and 
fohy030 transcrt|rt».*The fomy030 splice version results in 
a protein product ofSAr-aminoacids in length, while the 
\ P ^fohy030 splice variant is predic 

1497 amino a cids in length(FIG^_^)_:^ — 

AnotherspIIce"vananrirs^ in FIG. 6 (SEQ ID 
NO:^), and^codes a protein of 1533 amino acids in length 
(SEQ ID N0:9)\The cDNA of FIG. 5 (SEQ ID N0:6) is missing 
34 nucleotides beginning after 2879 in SEQ ID NO: 8, and is 
missing 74 nucleosides beginning after 2926 in SEQ ID NO: 8. 
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Thus, nucleotides 2^80-2892 in SEQ ID NO: 6 are identical to 
nucleotides 2914-292\ in SEQ ID NO: 8, and the sequences are 
essentially identical\tarting at 2893 in SEQ ID NO: 6 and 
3001 in SEQ ID NO: 8. Tfke difference in the respective amino 
acid sequences is that thk amino acids are identical from 1 
to 844, and then again frois850 to 1497 in SEQ ID NO: 7 and 
from 886 to 1533 in SEQ ID NO^. 

Within their common 5' \equences, fohy030 was also 
found to have an additional threXbase pairs (GGA) inserted 
after position 1394 in the mouse cd^A (at positions 1066- 
1068 in FIGS. 5 and 6). These additiWl three base pairs 
fall within the open reading frames of\^h fohy030 and 
fomy030, and result in an additional GlycW residue at 
position 356 within the open reading frame k^fohy030 
relative to fomy030. 

Northern analysis was performed to confirm the 
differential expression of the genes corresponding to the 
amplified bands, as described below. 

Twelve micrograms of total RNA sample, 1.5 x RNA 
loading dyes (60% formamide, 9% formaldehyde, 1.5 x MOPS, 
.075% X C/BPB dyes) at a final concentration of 1 x and HjO 
to a final volume of 40 nl were mixed. The tubes were 
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heated at 65 for 5 min. and then cooled on ice. The RNA 
samples analyzed were loaded onto a denaturing 1% agarose 
gel. Th gel was run overnight ht 32V in 1 x MOPS buffer. 

A 300 ml denaturing 1% agarose gel was made as 
follows. Three grams of agarose (SeaKem" LE, FMC 
BioProducts, Rockland, ME) and 60 ml of 5 x MOPS buffer 
(O.IM MOPS [pH 7.0], 40 mM NaOAc, 5mM EDTA [pH 8.0]) were 
added to 210 ml sterile H2O. The mixture was heated until 
melted, then cooled to 50»C, at which time 5 nl ethidium 
bromide (5 mg/ml) and 30 ml of 37% formaldehyde were added 
to the melted gel mixture. The gel was swirled quickly to 
mix, and then poured immediately. 

After electrophoresis, the gel was photographed with 
a fluorescent ruler, then was washed three times in DEPC 
H2O, for 20 minutes per wash, at room temperature, with 
shaking. The RNA was then transferred from the gel to 
Hybond-N<» membrane (Amersham) , according to the methods of 
Sambrook et al. . 1989, supra, in 20 x SSC overnight. 

The probes used to detect mRNA were typically 
synthesized as follows: 2 /xl amplified cDNA band (-30 ng) , 
7 /il H2O, and 2 /il 10 X Hexanucleotide mix (Boehringer- 
Mannheim) were mixed and heated to 95<>C for 5 min., and then 
allowed to cool on ice. The volume of the amplified band 
can vary, but the total volume of the band plus HjO was 
always 9 /xL 3 /il dATP/dGTP/dTTP mix (1:1:1 of 0.5 mM 
each), 5 Ml a^^P <iCTP ^000 Ci/mM (50 /iCi total; Amersham, 
Arlington Heights, IL) , and 1 ^1 Klenow (2 units; 
Boehringer-Mannheim) were mixed and incubated at 37 »C. 
After 1 hr., 30 /il TE were added and the reaction was loaded 
onto a Biospin-6- column (Biorad, Hercules, CA) , and 
centrifuged. A 1 nl aliquot of eluate was used to measure 
incorporation in a scintillation counter with scintillant to 
ensure that 10^ cpm/^l of incorporation was achieved. 
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For pre-hybridization, the blot was placed into a 
roller bottle containing 10 ml of rapid-hyb solution 
(Amersham), and placed into 65»C incubator for at least 1 
hr. For hybridization, 1 x 10^ cpm of the probe was then 
heated to 95»C, chilled on ice, and added to 10 ml of rapid- 
hyb solution. The prehybridization solution was then 
replaced with probe solution and incubated for 16 hours at 
65<»C. The following day, the blot was washed once for 20 
min. at room temperature in 2 x SSC/0.1% SDS and twice for 
15 min. at 65«C in 0.1 x SSC/0.1% SDS before being covered 
in plastic wrap and put down for exposure. 
Tn Situ Hy bridization 

10 Mm sections of formalin fixed/ paraffin embedded 
benign nevi (non-metastic growths of melanocytes) and 
malignant melanoma were post-fixed with 4% PFA/PBS for 15 
minutes. After washing with PBS, sections were digested 
with 21 iig/nl proteinase K at 37 for 15 minutes, and again 
incubated with 4% PFA/PBS for 10 minutes. Sections were 
then washed with PBS, incubated with 0.2 N HCl for 10 
minutes, washed with PBS, incubated with 0.25% acetic 
anhydride/ 1 M triethanolamine for 10 minutes, washed with 
PBS, and dehydrated with 70% ethanol and 100% ethanol. 

Hybridizations were periformed with ^^g-radiolabeled 
(5 X lo"' cpm/ml) cRNA probes encoding a 1.1 kB segment of 
the coding region of the human cDNA (clone fohy030) , and a l 
kB segment of the coding region of the human H4 histone gene 
in the presence of 50% formamide, 10% dextran sulfate, 1 x 
Denhardt's solution, 600 mM NaCl, 10 mM DTT, 0.25% SDS, and 
100 fiq/ml tRNA for 18 hours at 55»C. The H4 histone gene 
was used as a control to show proper transcription of RNA. 

After hybridization, slides were washed with 5 x SSC 
at 55 'C, 50% formamide/ 2 x SSC at 55<»C for 30 minutes, 10 mM 
Tris-HCl(pH 7.6) /500 mM NaCl/1 mM EDTA (TNE) at 37«C for 10 
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minutes, washed in TNE at 37 'C for 10 minutes, incubated 
once in 2 X SSC at 50»C for 30 minutes, twice in 0.2 x SSC 
at 50»C for 30 minutes, and dehydrated with 70% ethanol and 
100% ethanol. Localization of mRNA transcripts was detected 
by dipping slides in Kodak NBT-2 photo-emulsion and exposing 
for 4 days at 4«C. Controls for the in situ hybridization 
experiments included the use of a sense probe which showed 
no signal above backgrounds levels. 

6.2. RESULTS 
An in vitro paradigm, as described, above, in 
Section 5.1.1.1, was carried out using the melanoma cell 
lines, B16 Fl and B16 FIO. The 316 Fl cell line exhibits a 
low metastatic potential, while the 316 FIO cell line 
exhibits a high metastatic potential. Thus, the two cell 
lines were grown in vitro as described in Section 6.1.1, RNA 
was isolated from these cells and differential display 
carried out as described in Section 6.1. 

The differential display analysis identified a band, 
designated romy030, which represents a cDNA derived from RNA 
produced by a gene which was expressed at a much higher 
level in the 316 Fl cells, i.e. . the low metastatic 
potential cells, relative to the gene's expression in 316 
FIO cells, 1^, high metastatic potential cells. The gene 
corresponding to the romy030 band is referred to herein as 
the fomy030 or 030 gene. 

The amplified romy030 band was isolated, 
reamplified, subcloned, and sequenced, as described, above, 
in section 6.1.3. The romy030 nucleotide sequence (SEQ ID 
N0:1) is shown in FIG. 2. 

A BLAST (Altschul, S.F. et al., 1990, J. Mol. Biol. 
21^:403-410) database search with the romy030 nucleotide 
sequence revealed no sequences within the database which are 
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similar to that of romy030. Thus, 030, the gene 
corresponding to romy030, appears to represent a novel, 
previously unknown gene which is differentially expressed in 
cells exhibiting a low metastatic potential relative to 
those cells exhibiting a high metastatic potential. 

TO confirm this putative differential regulation, 
amplified romy030 cDNA was used to probe Northern RNA blots 
containing RNA from B16 Fl and B16 FIO cells. FIG. 1 shows 
the results of one such Northern blot analysis, in which it 
is demonstrated that the steady state levels of fomy030 mRNA 
are significantly higher in the low metastatic potential 
cells (jL^, the B16 Fl cells) relative to the high 
metastatic potential cells (i^, 316 FIO cells) . Lanes 1 
and 3 represent Fl cells and Lanes 2 and 4 represent FIO 
cells respectively. Thus, this Northern analysis confirmed 
the putative differential fomy030 regulation which had been 
suggested by the differential display results. 

^ TWO specific oligonucleotides were generated based 
on the seqtience of romy030, romy030U 5'- 
GGGGAAGCACATb^GGAAC-3' (SEQ ID N0:4) and romy030L 5'- 
GCAACTACACTCGgWgC-3' (SEQ ID N0:5), for use in PGR 
reactions. cDNA\4^raries prepared from mRNA isolated from 
normal melanocytes ^ a mouse melanoma cell line were 
analyzed for the presd^^of fomy030 by PGR, utilizing the 
above romy030 probes. Fo^O was detected in the 
melanocyte library but not i^K^e melanoma library. The 
melanoma library was generated >^ a highly metastatic 
mouse melanoma K-1735 m2. This re^ is consistent the 
observation that fomy030 is present a^^duced levels in the 
metastatic B16 FIO melanoma cell line, ^ioactive DNA 
probe was generated from the subcloned romyoX^A. This 
probe was used to screen the normal mouse melan^sjj^te cDNA 
library. Three independent positive clones were idfentified 
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and isolated during this screening. These clones were 
d sifted fomy030a, fomy030b, and fomy030c. These cDNAs 
were s^^ti^ed and th overlapping portions were found to be 
identical/'^Ptl^ nucleotide sequence of all three fomy030 
cDNAs, designate^S^s the fomyOSO sequence (SEQ ID NO: 2) is 
depicted in FIGS. 3A^H:id3B, and contains the sequence of 
romy030. The findings d^^CJC^bed herein suggest a novel role 
for fomy030 in tumor progressiwt^-^ down-regulation of 030 
can be used as a diagnostic marker fbi?<tumor progression, 
especially for the progression to metasta^if. Further, 030 
gene products can be used in the prevention and-i^reatment of 
tumor progression disorders. 
yohyOSO P v pregaio n in Human Tlaaue Samples 

To determine whether the fohy030 gene product is 
differentially expressed in clinically relevant human 
disease, fohy030 gene expression was analyzed in biopsy 
sections of human benign nevi (non-metastic growths of 
melanocytes) and malignant melanoma using in situ 
hybridization. Fohy030 expression was detected in small 
intermittent cells in the basal layer of the epidermis 
(likely, melanocytes) and in the majority of nevus cells in 
patients diagnosed with benign nevi. No expression of 
fohy030 was detected in the majority of melanoma cells in 
patients diagnosed with metastatic melanoma, though 
expression was detected in normal melanocytic cells in the 
same tissue section. These results show that the fohy030 
gene product is associated with metastasis suppression. 

6.3. 030 GENB EXPRESSION IB INVERSELY 

CQRRELATPP WITH ME T^aT^TIC POTENTIAL 

6.3.1. gCTBRIMENTAI. PROTOCQ T.H ^MP RESULTS 

The relationship between 030 gene expression and 
tumor progression was confirmed as described herein. 
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specifically, the metastatic potentials of six variants of 
th B16 c 11 line wer tested in animals and the metastatic 
potential was compared to the level of 030 gene expression 
observed within the cell variants. 

A single cell suspension of B16 Fl cells (low 
metastatic potential) was injected intravenously into 
syngeneic C57BL/6 mice. After three weeks, lung tumors were 
excised and seeded into tissue culture. The following six 
cell lines were grown in culture: B16 Gl, B16 G2, B16 G3, 
B16 G4, B16 G9 and B16 G12. 

To test the metastatic ability of the above listed 
six tumor cell lines, the same number of cells for each of 
the six cell lines intravenously into different groups of 
syngeneic C57BL/6 mice. Three weeks later, the mice were 
killed and the lungs were removed aseptically. 
Significantly more number of tumors were observed in mice 
injected with the following three cell lines: B16 G4, B16 
G9 and B16 G12. These results demonstrate that the B16 G4, 
B16 G9 and B16 G12 cell lines have high metastatic potential 
and the B16 Gl, B16 G2 and B16 G3 cell lines have low 
metastatic potential. 

The lung tumors produced from these three highly 
metastatic cell lines (B16 G4, B16 G9 and B16 G12) were then 
excised and seeded into tissue culture to produce the 
following four cell lines: B16 H5, B16,H6, B16 H7 and B16 
H8. 

Northern analysis was performed to determine the 
expression of 030 gene in the above listed cell lines (i^, 
B16 H5, B16,H6, B16 H7 and B16 H8) using procedures 
described above in Section 6.1.3. Figure 4 shows the 
results of one such Northern blot analysis, in which it is 
demonstrated that the steady state levels of 030 mRNA are 
significantly lower in the highly metastatic cells (; 
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B16 H5. B16,H6, 316 H7 and B16 H8) relative to the B16 Fl 
low metastatic potential cells. Lane 1 represents the B16 
Fl cells, lane 2 is B16 FIO metastatic cells and lanes 3-6 
represent B16 H5, B16,H6, B16 H7 and B16 H8. 

Thus, this Northern analysis confirmed the initial 
finding in this invention that 030 expression is inversely 
related to the metastatic potential of tumor cells and 
supports the theory that the 030 gene product plays a role 
in inhibiting tumor progression, including the progression 
to a high metastatic potential state. In this regard, it is 
important to note that the tumor cell number and 
homogeneity, and the syngeneic recipient did not change from 
one cell line to another in the above protocols. Therefore, 
the differences in metastatic incidence can only be 
attributed to properties intrinsic to the various cell lines 
used. The clonal selection of tumors from successive 
metastases results in cells better capable of survival, 
formation and progression of tumor foci in the lung. This 
indicates that the decrease in expression of 030 observed m 
the highly metastatic four cell lines (i^, B16 H5, B16,H6, 
B16 H7 and B16 H8) is an intrinsic property of these cell 
lines and is related to the development, progression and 
metastatic potential of the tumor cells. 

7. example: USB OP PIMGBRPRIMT GENES 

The expression pattern of the fingerprint genes of 
the invention may be utilized as surrogate markers to 
monitor clinical human trials of drugs being tested for 
their efficacy as tumor progression treatments, or may, 
additionally, be used to monitor patients undergoing 
clinical evaluation for the treatment of tumor progression. 
"Fingerprint gene," as used herein is defined as in Section 
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3, abov . Individual fingerprint gene expression patt rns 
may b analyzed or, alternatively, fingerprint patterns may 
be analyzed. "Fingerprint pattern," as used herein is 
defined as in Section 3, above. 
5 The effect of the compound on the fingerprint gene 

expression normally displayed in connection with a disorder 
involving tumor progression can be used to evaluate the 
efficacy of the compound as a treatment for such a disorder. 
Additionally, fingerprint gene expression can be used to 
10 monitor patients undergoing clinical evaluation for the 
treatment of the disorder. 

According to the invention, the fingerprint gene 
^ expression and fingerprint pattern derived from any of the 

S paradigms described in Section 5.1.1.1 can be used to 

2 15 monitor clinical trials of drugs in human patients. The 
ll paradigms described in Section 5.1.1.1, and illustrated in 

P the Example presented in Section 6, above, for example, 

ti provide the fingerprint pattern of B16 melanoma cells. This 

profile gives an indicative reading, therefor, of the 
20 metastatic and non-metastatic states of melanoma cells, 
ti Accordingly, the influence of anticancer chemotherapeutic 

M agents on the melanoma cells can be measured by performing 

^ differential display on melanoma cells of patients 

undergoing clinical tests. 

25 7.1. TREATMENT OP PATIENTS AND PROCUREMENT 

OF TUMOR CELLS OR BIOPSIES 

Test patients can be administered compounds 
suspected of antimetastatic activity. Control patients can 
be given a placebo. 
30 Tumor cells or biopsies can be drawn from each 

patient after a determined period of treatment and RNA can 
be isolated as described in Section 6.6.1, above. 
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7.2. ANALYSIS OF SAMPLES 

RNA can be subjected to differential display 
analysis as described in Section 6.6.1, above. A deer ase 
in the metastatic potential of tumor cells is indicated by 
5 an increase in the intensity of the romy030 band, as 
described in Section 6.2, above. 

8. DEPOSIT OP MICROORGANISMS 

The following microorganism was deposited with the 
Agricultural Research Service Culture Collection (NRRL) , 
10 Peoria, Illinois, on March 3, 1995 and assigned the 
indicated accession number: 

Mierooraanlsm NRRL Acc eaaion No. 

E. coli B-21416 




OTHER EMBODIMENTS 

15 The present invention is not to be limited in scope 

by the specific embodiments described which are intended as 
single illustrations of individual aspects of the invention 
and functionally equivalent methods and components are 
within the scope of the invention, in addition to those 

20 shown and described herein will become apparent to those 
skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to 
fall within the scope of the appended claims. 



174591. Bll 
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SEQUENCE LISTING 



(1) GENERJiLL INFORMATIONS 

(1) Ai>PLICANTt Shy j an, Andrew W. 

(ii) TI^^ OP INVENTION: COMPOSITIONS AND METHODS FOR THE 

DIAGNOSIS, PREVENTION AND TREATMENT OF TUMOR 
PROGRESSION 

(iii) NUMBE'k OF SEQUENCES: 9 

(iv) CORRESP<)NDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STR^T: 225 Franklin Street 

(C) cityK Boston 

(D) STATE i 

(E) COUNTRY: USA 

(F) ZIP: 02\l0-2804 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYP\: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Pat^ntln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION \aTA: 

(A) APPLICATION NUMB&R: US 

(B) FILING DATE: 29-M>^-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: U^ 08/412,431 

(B) FILING DATE: 29-MAR-19^5 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Fasse, J. Peter 

REGISTRATION NUMBER: 32,9^ 
REFERENCE/DOCKET NUMBER: 07^34/004001 

(ix)XSJ^ECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 




INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GGTGCTGGAG TACCTCATGG GCGGTGCCTA CCOCTGCAAC TACACTCGGA AAAOdtTTCCG 60 
GACTCTCTAC AACAACTTGT TTGGCCCTAA GAGGGTAGAG CTCAGCAGAC ACACAGTdJC 120 
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CTGTGCCTCC CAGAGTS^CA TGTGGTTCCT TGATGTGCTT CCCCAAAAGC CCACCTGTGC 
AGAATG 

(2) INFORMATION POr\sBQ ID NO:2i 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH:\2729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDWESS: single 

(D) TOPOLOGYi linear 

(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 2: 
AAGGAGGCTA GGCTGCACCC TtVcCGCTTG CTCAGCAGCT GAGGCAGGGT CAGAAAGCAT 
GGATAGAGAA GACATTTTGC AAAAgGGAAT GCATCTTTGT AATTCCCAGT ACAAAAGACC 
CTAACAGATG TTGCTGTGGT CAgAcACTA ACCACCACAT CCCCCCTTTG CCGAGTGGGG 
CTCCCAGCAC AACAGGAGAG GACActAAGC AGGCAGACAC GCAGTCCGGG AAATGGTCTG 

tcagcaaaca cacccagagc tacccjSacag actcctatgg gattcttgaa ttccagggtg 

'C CAATAAAGCC ATG TAd ATC CGA GTC TCC TAC GAC ACC AAG 
Met Tyr\lle Arg Val Ser Tyr Aap Thr Lys 
\ ^ 10 

CCA/gAT TCC CTG CTC CAC CTC AtA GTG AAG GAC TGG CAG CTG GAG CTC 
Pr,/ Asp ser Leu Leu His Leu Met\val Lys Asp Trp Gin lIS gS "5 
/ 15 \ 20 25 



180 
186 



ceo AAG CTC TTG ATA TCT GTG CAC dGA GGC CTC CAA AGO TTC GAG ATG 
Pro Lys Leu Leu He Ser Val His cly Gly Leu Gin Ser Phe Glu Met 

A5 40 

CAG TCC AAA CTG AAG CAG GTG TTT Goi AAA GOT CTG ATC AAG GOT GCC 
Gin ser Lys Leu Lys Gin Val Phe G^Lys Gly Leu ill i^fs aS 

50 y 55 

ATG ACC ACC GGG GCO TGG ATC TTC ACC 4GG GOT GTG AGC ACT GOT GTC 
Met Thr Thr Gly Ala Trp lie Phe Thr ^ Gly Val Sev vlt 

GTC AGC CAT GTG GGG GAT GCC TTG AAA GAC CAC TCC TCC AAG TCC AGA 
val ser His Val Gly Asp Ala Leu Lys aS His Ser Ser i^s Ser 

80 *\ 85 5^ 

GGC CGG CTC TGT GOT ATA GGA ATT GCT CCc\tGG GGC ATG GTG GAG AAC 
Gly Arg Leu Cy. Ala lie Gly He Ala Proftrp Gly J2 5ai gK JJS 
95 100 V 105 

AAG GAA GAC CTG ATT GGA AAA GAT GTA ACA aW GTC TAT CAG ACC ATG 
Lys Glu ASP Leu He Gly Lys Asp Val Thr fig val Tyr 

115 \ 120 

TCC AAC CCT CTG AGC AAG CTC TCT GTG CTC AAc\aAT TCC CAC ACT CAC 
ser Aan Pro Leu Ser Lys Leu Ser Val Leu Asn Van Ser Hia Thr Hie 



60 
120 
180 
240 
300 
350 

398 

446 

494 

542 

590 

638 



686 



734 
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125 



130 



135 



TTC ATC TTG GCT GAC 
Ph He L u Ala Asp 
140 



\C GGC ACC CTG GGC AAG TAT GGT GCT GAG GTG 
3n Gly Thr Leu Gly Lys Tyr Gly Ala Glu Val 
145 



150 



AAG CTT CGA AGA CAG CTCJyGAA AAA CAC ATC TCC CTG CAG AAG ATC AAC 
Lys Leu Arg Arg Gin LeuXGlu Lys His He Ser Leu Gin Lys He Asn 
155 160 \ 165 170 

ACA AGG CTG GGC CAG GGT GTA CCT GTC GTG GGC CTA GTG GTA GAA GGT 
Thr Arg Leu Gly Gin Gly vAl Pro Val Val Gly Leu Val Val Glu Gly 
175 \ 180 185 

GGT CCT AAC GTG GTT TCT ATQ GTC CTG GAG TAT CTC AAA GAA GAC CCT 
Gly Pro Asn Val Val Ser Ile\val Leu Glu Tyr Leu Lys Glu Asp Pro 
190 \ 195 200 




CCT GTC CCT GTG GTG GTT TGC 
Pro Val Pro Val Val Val Cys 
205 

TTG TCC TTC GCA CAC AAA TAC 
Leii--Ser Phe Ala His Lys Tyr 

225 



GGC AGT GGA CGT GCC TCT GAC ATT 
pp Gly Ser Gly Arg Ala Ser Asp He 

215 

GAC GAA GGA GGA GTC ATA AAC GAG 
^Asp Glu Gly Gly Val He Asn Glu 
230 



TCC/CTG CGG GAC CAG CTT CTA GTT 
Ser Leu Arg Asp Gin Leu Leu Val 

240 



ATT CAG AAA ACA TTT AAT TAC 
ir He Gin Lys Thr Phe Asn Tyr 
245 250 



AGC AAG TCC CAG TCG TAT CAG CTG TT 
Ser Lys Ser Gin Ser Tyr Gin Leu Phe^ 
255 

AAG AAG AAA GAA CTC GTC ACT GTG TTT 
Lys Lys Lys Glu Leu Val Thr Val Phe 
270 275 



GCA ATT ATC ATG GAG TGC ATG 
Ala He He Met Glu Cys Met 

!60 265 

ATG GGT TCC GAG GGT CAG 
:g Met Gly Ser Glu Gly Gin 
280 



CAA GAT GTC GAG ATG GCA ATT TTA ACT GcV 
Gin Asp Val Glu Met Ala He Leu Thr AU 
285 290 

GCA TCA GCT CCA GAT CAG CTG AGC TTG GCC 
Ala Ser Ala ?ro Asp Gin Leu Ser Leu Ala 
300 305 



TTG CTC AAA GGA ACC AAC 
Leu Leu Lys Gly Thr Asn 
295 

^TG GCT TGG AAC CGG GTC 
3u Ala Trp Asn Arg Val 
310 



GAC ATA GCG CGA AGC CAG ATC TTC GTC TTT GGOL CCA CAC TGG CCG CCA 
Asp He Ala Arg Ser Gin He Phe Val Phe GlyXPro His Trp Pro Pro 
315 320 325 \ 330 



CTG GGA AGC CTG GCC CCT CCT GTG GAC ACC AAA 
Leu Gly Ser Leu Ala Pro Pro Val Asp Thr Lys 
335 340 

AAG AAG CCA CCC ACA GCC ACC ACC AAG GGG AGA GGA^ 
Lys Lys Pro Pro Thr Ala Thr Thr Lys Gly Arg Gly 
350 355 



GCA GAG AAG GAA 
Ala Glu Lys Glu 
345 

. GGA AAA GGC 
[*ys Gly Lys Gly 
360 



782 



830 



878 



926 



974 



1022 



1070 



1118 



1166 



1214 



1262 



1310 



1358 



1406 
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AAO AA0 AAA GGC AAA 
Lys Lys Lys Gly Lys 

365 



AAA GAG GAA GTG GAG GAA GAG ACG GAC CCC 
Lys Glu Glu Val Glu Glu Glu Thr Asp Pro 
370 375 




CGG AAG CTT GAG CTG CTC )\AC TGG GTG AAT GCC CTG GAG CAA GCC ATG 
Arg Lys Leu Glu Leu Leu nan Trp Val Aen Ala Leu Glu Gin Ala Met 
380 395 390 

CTG GAT GCT CTT GTC CTA GA* CGG GTG GAC TTT GTA AAG CTC CTG ATT 
Leu Asp Ala Leu Val Leu Asa Arg Val Asp Phe Val Lys Leu Leu lie 
395 400 \ 405 410 

GAA AAC GGA GTG AAC ATG CAG CAT TTC CTC ACC ATC CCG AGG CTG GAG 
Glu Asn Gly Val Aan Met Gin His Phe Leu Thr lie Pro Arg Leu Glu 
415 \ 420 425 

GAG CTA TAC AAC ACC AGA CTG GGC CCA CCA AAC ACC CTT CAT CTG CTG 
Glu Leu Tyr Asn Thr Arg Leu Gly Pro Pro Asn Thr Leu His Leu Leu 
430 \ 435 440 

GTG CGG GAT GTA AAG AAG AGC AAC CTT CCA CCT GAT TAC CAC ATC AGC 
Val Arg Asp Val Lys Lys Ser AsA Leu Pro Pro Asp Tyr His lie Ser 
45 450\ 455 

GAT ATA GGA CTG GTG CTG ^AG TAC CTC ATG GGC GGT GCC TAC 
Asp lie Gly Leu Val Leu Glu Tyr Leu Met Gly Gly Ala Tyr 
465 \ 470 

AAC TAC ACT CGG AAA AGC TTC CGG ACT CTC TAC AAC AAC TTG 
Asn Tyr Thr Arg Lys Ser Phb Arg Thr Leu Tyr Asn Asn Leu 
480 \ 485 490 



CCT AAG AGG GTA GAG CTC AGCl 
Pro Lys Arg Val Glu Leu Ser 
495 



AGA CAC ACA GTG TCC TGT GCC 
g His Thr Val Ser Cys Ala 
00 505 



AGT AAC ATG TGG TTC CTT GAT G\ 
Ser Asn Met Trp Phe Leu Asp Va 
510 515 



CTT CCC CAA AAG CCC ACC 
Leu Pro Gin Lys Pro Thr 
520 



TGT GCA GAA TGC AAC TCT TCA CCT CAC CTGXTCC CAA ACT GAC ATC ACC 
Cys Ala Glu Cys Asn Ser Ser Pro His Leu^er Gin Thr Asp lie Thr 
525 530 \ 535 



CCA CCT CTG CCC T GACACCCAGT GCAGGGCCTC 
Pro Pro Leu Pro 
540 



CtAGCTTTCA CATGCAGCCA 



TTCACATCGC CTCTCAAGAC TGGGCCAGGC AGTGCAACCT \GTCAAGCATG TCTGTCCTCC 
CCTCCTTCCT ACAATAGCCC CCCCTCTGGG CCCCATGCCT CijrGCTCTCTC AGCCCGTTCT 
CCTCCCCACT GATCACTGGC GCTCCTGTTG TCTTCCAAGG C^iAGGAACAA GGAAAAGCAT 
CTTTTTGCCC ACAAAAGTTT AGGGCTCCCC GCTGTTCAAC CATAGCCAAC CTCACTGTAC 
ATCGGAGTCA TCCAGGCCAG CTGCCACACA CAAGCCTTCC CCACCCTATC CCAATAGACC 



1454 

1502 

1550 

1598 

1646 

1694 

1742 

1790 

1838 

1886 

1934 

1987 

2047 
2107 
2167 
2227 
2287 
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CTATTCCTCC 


ATCAAAATCA 


AAGCTAACTC 


CTGGCCTGCC 


ACATTGCTTC 


TTCTTGCTCC 


2347 


A6CCTGTTAA 


ACCTCCAATA 


AATGTCAGAT 


CTGTGGGAAG 


CCTTCCTCAC 


TCTCACTCCA 


2407 


CAGTTTGTAC 


AGAGAGCGAG 


AGCCTCGTTT 


GGTTCTACTT 


ACAAGGAAGG 


CTTTGTGTCT 


2467 


GTCTGTCCTT 


CCCAACTGAC^ 


TTCTGTTGAC 


AGAAGCAGTT 


TCCACATGAA 


AGCGTTGACT 


2527 


CACCTGGAT6 


TTGTCATTAA 


TTAATAGTGA 


TACAAAATAT 


TGACACTTCT 




Z 30 / 


CTTTGTTATG 


CAGCCGAAAG 


CACTTAAGCT 


TCTGGGAATG 


GAAGTAAGTA 


GGACATGTTT 


2647 


GTGGCAGTTT 


ATTTACTATA 


TATACCTTTG 


TCATTCTGTG 


GAAGCAAAAA 


TTGCAATGTT 


2707 


TTCCATGAAT 


AAAGCTCGTG 


CC \ 








2729 




Val Phe 
50 



(2) INFORMATION FOR SEQ 10^0:3: 

(i) SEQUENCE CHARACTBpISTICS: 

(A) LENGTH: 542\amino acids 

(B) TYPE: amino \ac id 
(D) TOPOLOGY: lihear 

xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

r lie Arg Val Ser Tyr AspXThr Lys Pro Asp Ser Leu Leu Hia 
5 \ 10 15 

Val Lys Asp Trp Gin Leu g\u Leu Pro Lys Leu Leu lie Ser 
20 is 30 

Gly Gly Leu Gin Ser Phe GlA Met Gin Ser Lys Leu Lys Gin 
35 40 \ 45 

Gly Lys Gly Leu He Lys Ala Xla Met Thr Thr Gly Ala Trp 
55 \ 60 

He Phe Thr Gly Gly Val Ser Thr Gly VaU Val Ser His Val Gly Asp 
65 70 \ 75 80 

Ala Leu Lys Asp His Ser Ser Lys Ser Arg\Gly Arg Leu Cys Ala He 

85 90 \ 95 

Gly He Ala Pro Trp Gly Met Val Glu Asn lVs Glu Asp Leu He Gly 
100 105 A 110 

Lys Asp Val Thr Arg Val Tyr Gin Thr Met SeA Asn Pro Leu Ser Lys 
115 120 \ 125 

Leu Ser Val Leu Asn Asn Ser His Thr His Phe \Ib Leu Ala Asp Asn 
130 135 V4Q 

Gly Thr Leu Gly Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Gin Leu 
145 150 155 \ 160 

Glu Lys His He Ser Leu Gin Lys He Asn Thr Arg\Leu Gly Gin Gly 
165 170 \ 175 
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val Pr val Val Gly LeL Val Val Glu Gly Gly Pro Asn Val Val Ser 
180 \ 185 190 

II Val Leu Olu Tyr LeA Lys Glu Asp Pro Pro Val Pro Val Val Val 

195 \ 200 205 

Cys Asp Gly Ser Gly Arg Ser Asp He Leu Ser Phe Ala His Lys 



U5 



220 



Tyr cys Asp Glu Gly Gly V^l He Asn Glu Ser Leu Arg Asp Gin Leu 



230 



235 



240 



Leu Val Thr He Gin Lys Thjr Phe Asn Tyr Ser Lys Ser Gin Ser Tvr 

245 \ 250 255 

Gin Leu Phe Ala He He Met\Glu Cys Met Lys Lys Lys Glu Leu Val 
260 \ 265 270 

Thr Val Phe Arg Met Gly Ser blu Gly Gin Gin Asp Val Glu Met Ala 
275 iaO 285 

290 "^^^ "^^^ ^"^^ 



295 



300 



Lew 
}05 



Led 



Ala Leu Ala Trp Ash Arg Val Asp He Ala Arg Ser Gin 
310 \ 315 320 

He Phe >^al Phe Gly Pro His Trp\pro Pro Leu Gly Ser Leu Ala Pro 

325 \ 330 335 

\ Pfb !/al Asp Thr Lys Ala Ala Glu ^s Glu Lys Lys Pro Pro Thr Ala 
" 345 350 

r Thr Lys Gly Arg Gly Lys Gly Lyb Gly Lys Lys Lys Gly Lys Val 

•»55 360 \ 365 

Lys Glu Glu val Glu Glu Glu Thr A8p\pro Arg Lys Leu Glu Leu Leu 



375 



380 



Asn Trp Val Asn Ala Leu Glu Gin Ala iLt Leu Asp Ala Leu Val Leu 

390 T 395 

Asp Arg Val Asp Phe Val Lys Leu Leu IlW Glu Asn Gly Val Asn Met 
405 4i6 

Gin His Phe Leu Thr He Pro Arg Leu G1u\g1u Leu Tyr Asn Thr Arg 



425 



430 



Leu Gly Pro Pro Asn Thr Leu His Leu Leu ^al Arg Asp Val Lys Lys 



440 



445 



Ser Asn Leu Pro Pro Asp Tyr His He Ser Le\ He Asp He Gly Leu 



455 



460 



Val Leu Glu Tyr Leu Met Gly Gly Ala Tyr Arg\:ys Asn Tyr Thr Arg 



470 



475 \ 480 
Lys Ser Phe Arg Thr Leu Tyr Asn Asn Leu Phe Gly Pro Lys Arg Val 
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485 



490 



495 




Glu Leu ser Arb His Thr VaX Ser Cys Ala Sor Gin Ser Aan Met Trp 
500 505 510 

Phe Leu Asp Val\Leu Pro Gin Lys Pro Thr Cys Ala Glu Cys Aan Ser 
515 \ 520 525 

Ser Pro His Leu ^r Gin Thr Asp He Thr Pro Pro Leu Pro 
530 \ 535 540 

(2) INFORMATION ?oi{ SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 20 base pairs 

(B) TYPE: bucleic acid 

(C) STRANDMNESS: single 

(D) TOPOLOGY: linear 

(if/l) SEQUENCE DESCJ^IPTION: SEQ ID NO: 4: 
GCAC ATCAAGGAAC 
) INFORMATION FOR SEQ >JD NO: 5: 

(i) SEQUENCE CHARACTfeRISTICSi 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleib acid 

(C) STRANDEDNESS\ single 

(D) TOPOLOGY: linbar 



20 



(xi) SEQUENCE DESCRIPTION\j SEQ ID NO: 5: 
GCAACTACTA CACTCGGAAA AGC 
(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTIC^: 

(A) LENGTH: 4944 base j^airs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singlj 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



23 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 346.. 4837 



(xi) SEQUENCE DESCRIPTION: SEQ ID 
ACTCATTATA GGGNTCGAGC GGCCGCCCGG GCAGGTTtGA GCTGTGCCCT CTCCATTCCA 
CTGCTGTGGC AGGGTCAGAA ATCTTGGATA GAGAAAACCT TTTGCAAACG GGAATGTATC 



60 
120 



" 136 



TTTGTAATTC CTAGCAC6AA AGACTCTAAC AGGTGTTGCT GTGGCCAGTT CACCAACCAG 
GATATCCCCC CTCTGCCAAG ^GCAACACCC AGCAAAAATG AAGAGGAAAG CAAACAGGTG 
GAGACTCAGC CTGAGAAATG GtCTGTTGCC AAGCACACCC AGAGCTACCC AACAGATTCC 



TATGGAGTTC TTGAATTCCA 



5TGGCGGA TATTCCAATA AAGCC ATG TAT ATC 

Met Tyr He 
1 



CGT GTA TCC TAT GAC ACC Aito CCA GAC TCA CTG CTC CAT CTC ATG GTG 
Arg Val Ser Tyr Asp Thr Lyp Pro Asp Ser Leu Leu His Leu Met Val 
5 IP 15 




AAA GAT TGG CAG CTG GAA CTC 
Lye Asp Trp Gin Leu Glu Let 
20 25 

GGC CTC CAG AAC TTT GAG ATG 
Gly Leu Gin Asn Phe Glu Met 
40 

AAA GGC/CTG ATC AAG GCT GCT 
Lys Gly Leu lie Lys Ala Ala 
55 



CCC AAG CTC TTA ATA TCT GTG CAT GGA 
Pro Lys Leu Leu lie Ser Val His Gly 
30 35 

[CAG CCC AAG CTG AAA CAA GTC TTT GGG 
In Pro Lys Leu Lys Gin Val Phe Gly 
45 50 

G ACC ACC GGG GCC TGG ATC TTC ACC 
M^t Thr Thr Gly Ala Trp He Phe Thr 
60 65 



GGT GTC AGC ACA GGT GTT ATCt AGC CAC GTA GGG GAT GCC TTG AAA 
Gly Val Ser Thr Gly Val Ile\ser His Val Gly Asp Ala Leu Lys 
70 75\ 80 



CGG GTT TGT GCT ATA GGA ATT GCT 
g Val Cys Ala He Gly He Ala 
95 

GAC CTG GTT GGA AAG GAT GTA 
Asp Leu Val Gly Lys Asp Val 
110 115 



GAC CAC TCC TCC AAG TCC AGA GGC 
Asp His Ser Ser Lys Ser Arg Gly 
85 90 

CCA TGG GGC ATC GTG GAG AAT AAG 
Pro Trp Gly He Val .Glu Asn Lys 
100 105 



ACA AGA GTG TAC CAG ACC ATG TCC AAA CCT CTA AGT AAG CTC TCT GTG 
Thr Arg Val Tyr Gin Thr Met Ser Asn\pro Leu Ser Lys Leu Ser Val 
120 \l25 130 



CTC AAC AAC TCC CAC ACC CAC TTC ATC 
Leu Asn Asn Ser His Thr His Phe He 
135 140 



?TG GCT GAC AAT GGC ACC CTG 
»u Ala Asp Asn Gly Thr Leu 
145 



GGC AAG TAT GGC GCC GAG GTG AAG CTG 
Gly Lys Tyr Gly Ala Glu Val Lys Leu Ari 
150 155 



AGC CTG CTG GAA AAG CAC 
Arg Leu Leu Glu Lys His 
160 



ATC TCC CTC CAG AAG ATC AAC ACA AGA CTG \ GGG CAG GGC GTG CCC CTC 
He Ser Leu Gin Lys He Asn Thr Arg LeulGly Gin Gly Val Pro Leu 
165 170 I 175 



GTG GGT CTC GTG GTG GAG GGG GGC CCT AAC 
Val Gly Leu Val Val Glu Gly Gly Pro Asn 
180 185 



STG GTG TCC ATC GTC TTG 
fal Val Ser He Val Leu 
190 195 



180 
240 
300 
354 

402 

450 

498 

546 

594 

642 

690 

738 

786 

834 

882 

930 
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GAA TAC CTG CAA GAA Gl 
Glu Tyr Leu Gin Glu Gil 
200 



CCT CCC ATC CCT GTG GTG ATT TGT GAT GGC 
Pro Pro He Pro Val Val He Cys Asp Gly 
205 210 



AGC GGA CGT GCC TCG GAC\ ATC CTG TCC TTT GCG CAC AAG TAC TGT GAA 
Ser Gly Arg Ala Ser Aspllle Leu Ser Phe Ala Hia Lys Tyr Cys Glu 
215 \ 220 225 



GAA GGC GGA ATA ATA AAT 
Glu Gly Gly He He Asn 
230 



\G TCC CTC AGG GAG CAG CTT CTA GTT ACC 
Lu Ser Leu Arg Glu Gin Leu Leu Val Thr 
235 240 



978 



1026 



1074 



ATT CAG AAA ACA TTT AAT TAti 
He Gin Lys Thr Phe Asn Tyi 
245 25C 



AAT AAG GCA CAA TCA CAT CAG CTG TTT 
Asn Lys Ala Gin Ser His Gin Leu Phe 
255 



1122 



GCA ATT ATA ATG GAG TGC ATGI AAG AAG AAA GAA CTC GTC ACT GTG TTC 
Ala He He Met Glu Cys Met\Lys Lys Lys Glu Leu Val Thr Val Phe 
260 265 \ 270 275 



AGA ATG GGT TCT GAG GGC CAG 
Arg Met Gly Ser Glu Gly Gin 
280 




:AG GAC ATC GAG ATG GCA ATT TTA ACT 
[In Asp He Glu Met Ala He Leu Thr 
285 290 



CTG AAA GGA ACA AAC GTA TCT GCT CCA GAT CAG CTG AGC TTG 
Leu Lys Gly Thr Asn vil Ser Ala Pro Asp Gin Leu Ser Leu 
295 \ 300 305 



340 



GCT TGG AAC CGC GTG GA< 
Ala Trp Asn Arg Val As 
310 315 

CCC CAC TGG ACG CCC CTG 
Pro His Trp Thr Pro Leu 
330 

GCC ACG GAG AAG GAG AAG 
Ala Thr Glu Lys Glu Lys 
345 



ATA GCA CGA AGC CAG ATC TTT GTC 
He Ala Arg Ser Gin He Phe Val 
320 

3A AGC CTG GCA CCC CCG ACG GAC 
fly Ser Leu Ala Pro Pro Thr Asp 
^ 335 

CCA CCC ATG GCC ACC ACC AAG 
Pro Pro Met Ala Thr Thr Lys 
350 355 



GGA GGA AGA GGA AAA GGG AAA GGC AAG> 
Gly Gly Arg Gly Lys Gly Lys Gly Lys 
360 

GAA GTG GAG GAA GAA ACT GAC CCC CGG 
Glu Val Glu Glu Glu Thr Asp Pro Arg 
375 380 



AAG 



AAA GGG AAA GTG AAA GAG 
ys Lys Gly Lys Val Lys Glu 
65 370 

VG ATA GAG CTG CTG AAC TGG 
rs He Glu Leu Leu Asn Trp 
385 



GTG AAT GCT TTG GAG CAA GCG ATG CTA GA 
Val Asn Ala Leu Glu Gin Ala Met Leu Aspl 
390 395 

GTC GAC TTT GTG AAG CTC CTG ATT GAA AAC 
Val Asp Phe Val Lys Leu Leu He Glu Asn 
405 410 



GCT TTA GTC TTA GAT CGT 
Ala Leu Val Leu Asp Arg 
400 

3A GTG AAC ATG CAA CAC 
Ly Val Asn Met Gin His 
415 



1170 



1218 



1266 



1314 



1362 



1410 



1458 



1506 



1554 



1602 



TTT CTG ACC ATT CCG AGG CTG GAG GAG CTT TAT AAC ACA AGA CTG GGT 
Phe Leu Thr He Pro Arg Leu Glu Glu Leu Tyi Asn Thr Arg Leu Gly 



1650 
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420 



430 



435 



CCA CCA AAC ACA CTT CA1 
Pro Pro Asn Thr Leu Hi£ 
440 



CTG CTG GTG AGO GAT GTG AAA AAG AGC AAC 
Leu Leu Val Arg Asp Val Lys Lys Ser Aan 
445 450 



1698 



CTT CCG CCT GAT TAC CAC 
Leu Pro Pro Asp Tyr Hia 
455 



^TC AGC CTC ATA GAC ATC GGG CTC GTG CTG 
[le Ser Leu lie Asp lie Gly Leu Val Leu 
460 465 



1746 



GAG TAC CTC ATG GGA GGA G^C TAC CGC TGC AAC TAC ACT CGG AAA AAC 
Glu Tyr Leu Met Gly Gly Al|a Tyr Arg Cya Asn Tyr Thr Arg Lya Aan 
470 \ 475 480 



1794 



TTT CGG ACC CTT TAC AAC AA' 
Phe Arg Thr Leu Tyr Asn As 
485 490] 



TTG TTT GGA CCA AAG AGG CCT AAA GCT 
Leu Phe Gly Pro Lya Arg Pro Lya Ala 
495 



1842 




CTT AAA CTT CTG GGA ATG GAA1 
Leu Lya Leu Leu Gly Met Glu 
500 505 



AAA AAA AAG AAA AAG 
Lya Lya Lya Lya Lya 
520 



GAT GAT GAG CCT CCA GCT AAA GGG AAG 
ap Aap Glu Pro Pro Ala Lya Gly Lya 
510 515 

Jag GAA GAG ATC GAC ATT GAT GTG GAC 
Lu Glu Glu lie Aap lie Asp Val Aap 
525 530 



GCC GTG AGT CGG TTC 
Ala Val Ser Arg Phe Gl^ 
535 



TAT CCC TTC CAC GAG CTG ATG GTG 
Tyr Pro Phe Hia Glu Leu Met Val 
540 545 



GTG CTG ATG AAA CGC CAGVAAA ATG GCA GTG TTC CTC TGG CAG 
Val Leu Met Lya Arg GlnXLys Met Ala Val Phe Leu Trp Gin 
550 555 \ 560 



GAA GAG AGC ATG GCC AAG 
Glu Glu Ser Met Ala Lya 
565 570 



ice CTG GTG GCC TGC AAG CTC TAC 
a Leu Val Ala Cya Lya Leu Tyr 
575 



AAG GCC ATG GCC CAC GAG TCC TCC GAA 
Lya Ala Met Ala Hia Glu Ser Ser Gli 
580 585 



AGT GAT CTG GTG GAT GAC ATC 
Ser Asp Leu Val Aap Aap lie 
590 595 



1890 



1938 



1986 



2034 



2082 



2130 



TCC CAG GAC TTG GAT AAC AAT TCC AAA 
Ser Gin Asp Leu Aap Aan Aan Ser Lya 
600 



gAC TTC GGC CAG CTT GCT TTG 
Phe Gly Gin Leu Ala Leu 
)5 610 



2178 



GAG TTA TTA GAC CAG TCC TAT AAG CAT GAt 
Glu Leu Leu Aap Gin Ser Tyr Lys Hia Asj 
615 620 



GAG CAG ATC GCT ATG AAA 
Glu Gin lie Ala Met Lya 
625 



2226 



CTC CTG ACC TAC GAG CTG AAA AAC TGG AGC 
Leu Leu Thr Tyr Glu Leu Lya Aan Trp Ser 
630 635 



^C TCX5 ACC TGC CTC AAA 
jLsn Ser Thr Cys Leu Lya 
640 



2274 



CTG GCC GTG GCA GCC AAA CAC CGG GAC TTC A% 
Leu Ala Val Ala Ala Lys His Arg Aap Phe IJ 
645 650 



GCT CAC ACC TGC AGC 
Ala Hia Thr Cya Ser 
655 



2322 
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CAO ATO CTO CTO AC 
Gin Met Leu Leu Th 
660 

AAC CCC GGC CTG AAG 
Asn Pro Gly Leu Lye 
680 

TTG TTT TTG GAA TTT 
Leu Phe Leu Glu Phe 
695 



GAT ATG TGG ATG GGA AGA CTG CGG ATG CGG AAG 
Aap Met Trp Met Gly Arg Leu Arg Met Arg Lys 
665 670 675 

ATC ATG GGC ATT CTT CTA CCC CCC ACC ATC 
/al He Met Gly He Leu Leu Pro Pro Thr He 
685 690 

ACA TAT GAT GAT TTC TCG TAT CAA ACA TCC 
Thr Tyr Asp Asp Phe Ser Tyr Gin Thr Ser 
700 705 



AAG GAA AAC GAG GAT GGc\ AAA GAA AAA GAA GAG GAA AAT ACG GAT GCA 
Lys Glu Asn Glu Asp Gly\Lys Glu Lys Glu Glu Glu Asn Thr Asp Ala 
710 \ 715 720 



AAT GCA GAT GCT GGC TCA 
Asn Ala Asp Ala Gly Ser 
725 



3A AAG GGG GAT GAG GAG AAC GAG CAT AAA 
Lys Gly Asp Glu Glu Asn Glu His Lys 
735 



AAA CAG AGA ATT ATC CCC ATC\GGA ACA AAA ATC TGT AAA TTC TAT AAC 
Lys Gin Arg He He Pro IleXGly Thr Lys He Cys Lys Phe Tyr Asn 
740 745 \ 750 755 




lTT GTC AAG TTC TGG 
He Val Lys Phe Trp 
760 



TAC ACA ATA TCA TAC TTG GGC TAC 
Tyr Thr He Ser Tyr Leu Gly Tyr 
765 770 



CTG TTT AAC TAC GTC ATC 
Leu Phe Asn Tyr Val lU 
775 

CAG GAG TGG ATC GTC ATC 
Gin Glu Trp He Val He 
790 795 



ATA CGA GAG ATC CTC ATG Ti 
He Arg Glu He Leu Met Si 
810 



CTG GTG CGG ATG GAT GGC TGG CCG 
Leu Val Arg Met Asp Gly Trp Pro 
i780 785 



rCC TAC ATC GTG AGC CTG GCG TTA 
Ber Tyr He Val Ser Leu Ala Leu 
800 

GAA CCA GGC AAA CTC AGC CAG 
Glu Pro Gly Lys Leu Ser Gin 
815 



AAA ATC AAA GTT TGG CTT CAG GAG TAc\ TGG AAC ATC ACA GAT CTC GTG 
Lys He Lys Val Trp Leu Gin Glu Tyr\Trp Asn He Thr Asp Leu Val 
820 825 \ 830 835 



GCC ATT TCC ACA TTC ATG ATT GGA GCA 
Ala He Ser Thr Phe Met He Gly Ala 

840 Bl 

ATG ATT GGA AAG ATG ATG ATC GAC ATG 
Met He Gly Lys Met Met He Asp Met Lei 
855 860 



GCC ACG AGA TCT GTG ATG 
*t Ala Thr Arg Ser Val Met 

,5 850 

TAC TTT GTG GTC ATC ATG 
Tyr Phe Val Val He Met 
865 



CTG GTC GTG CTC ATG AGT TTC GGA GTA GCc\cGT CAA GCC ATT CTG CAT 
Leu Val Val Leu Met Ser Phe Gly Val Ala\Arg Gin Ala He Leu His 
870 875 \ 880 



CCA GAG GAG AAG CCC TCT TGG AAA CTG GCC 
Pro Glu Glu Lys Pro Ser Trp Lys Leu Ala 



3A AAC ATC TTC TAC ATG 
rg Asn He Phe Tyr Met 



2370 



2418 



2466 



2514 



2562 



2610 



2658 



2706 



2754 



2802 



2850 



2898 



2946 



2994 



3042 
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885 



890 



895 



CCC TAG TGG ATO ATC 
Pro Tyr Trp M t lie 
900 

TAG GCC ATG GAA ATT AA 
Tyr Ala Met Glu He 
920 



\T GGA GAG GTG TTT GCA GAC CAG ATA GAG CTC 
^r Gly Glu Val Phe Ala Asp Gin He Aap Leu 
« 910 915 



Asn 



GAG GGC AAG CGG CTT OCT 
Glu Gly Lye Arg Leu Pro 
935 



OCT COT TGT GGT GAG AAG CTA TAT GAT GAG 
.Pro Pro Cys Gly Glu Asn Leu Tyr Asp Glu 
^ 925 930 

^CC TGT ATC CCC GGC GCC TGG CTC ACT CCA 
ro Cys He Pro Gly Ala Trp Leu Thr Pro 
940 945 



GCA CTC ATG GCG TGC TAT 
Ala Leu Met Ala Cys Tyr Lei 
950 



CTG GTC GCC AAC ATC CTG CTG GTG AAC 
Leu Val Ala Asn He Leu Leu Val Asn 
955 960 




CTG CTG ATT GCT GTG TTC AAc\aAT ACC TTC TTT GAA GTA AAA TCA ATA 
Leu Leu He Ala Val Phe Asn y^sn Thr Phe Phe Glu Val Lys Ser He 
965 970 \ 975 



'CAG GTG TGG AAG TTC 
Gin Val Trp Lys Phe 
985 

AGG CCA GTC CTG CCC CC 
Arg Pro Val Leu Pro Pre 
1000 



hG CGA TAT CAG CTG ATT ATG ACA TTT 
In Arg Tyr Gin Leu He Met Thr Phe 



990 



995 



CCG ATG ATC ATT TTA AGG CAC ATC 
Pro Met He He Leu Ser His He 
1005 1010 



ATC ATT ATG CGT CTC AGC toGC CGC TGC AGG AAA AAG AGA GAA 
He He Met Arg Leu Ser ply Arg Cys Arg Lys Lys Arg Glu 
1015 l020 1025 



CAA GAG GAA CGG GAT CGT 
Gin Glu Glu Arg Asp Arg 
1030 1035 



GAG GAG CTA AAG AGG CTG CAT GAG 
Glu Glu Leu Lys Arg Leu His Glu 
1045 1050 



3A TTG AAG CTC TTC CTT AGC GAC 
Ly Leu Lys Leu Phe Leu Ser Asp 
1040 

GAG GAG CAG TGC GTG CAG GAG 
Glu Glu Gin Cys Val Gin Glu 
1055 



CAC TTC CGG GAG AAG GAG GAT GAG CAg\cAG TCG TCC AGC GAC GAG CGC 
His Phe Arg Glu Lys Glu Asp Glu GlnVSln Ser Ser Ser Asp Glu Aro 
1060 1065 \ 1070 1075 

ATC CGG GTC ACT TCT GAA AGA GTT GAA KAT ATG TCA ATG AGG TTG GAA 
He Arg Val Thr Ser Glu Arg Val Glu Afin Met Ser Met Arg Leu Glu 
1080 l(\85 1090 

ACT TCC CTG CAG ACT GTT 
Thr Ser Leu Gin Thr Val 
1105 

^CT AAC AGA ATG GTG AAT 
pr Asn Arg Met Val Asn 
1120 



GAA ATC AAT GAA AGA GAA ACT TTT ATG 
Glu He Asn Glu Arg Glu Thr Phe Met Lys^ 
1095 1100 

GAC CTT CGA CTT GCT CAG CTA GAA GAA TTA 
Asp Leu Arg Leu Ala Gin Leu Glu Glu Leu 
1110 1115 



3090 



3138 



3186 



3234 



3282 



3330 



3378 



3426 



3474 



3522 



3570 



3618 



3666 



3714 
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GCT CTT GAA AAT CTT GCG 
Ala Leu Glu Asn Leu Ala Glj 

1125 ii: 

CGG TCC CGG GCT TCT TCT C 
Arg s r Arg Ala Ser Ser C 
1140 1145 



ATC GAC AGG TCT GAC CTG ATC CAG GCA 
lie Asp Arg Ser Asp Leu He Gin Ala 
10 1135 

TGT GAG GCA ACG TAT CTT CTC CGG CAA 
Cys Glu Ala Thr Tyr Leu Leu Arg Gin 
1150 1155 



AGC AGC ATC AAT AGC GCT GAT \gGC TAC AGC TTG TAT CGA TAT CAT TTT 
Ser Ser He Aan Ser Ala Asp bly Tyr Ser Leu Tyr Arg Tyr His Phe 
1160 1 1165 1170 



AAC GGA GAA GAG TTA TTA TTT 
Asn Gly Glu Glu Leu Leu Phe 
1175 



SAG GAT ACA TCT CTC TCC ACG TCA CCA 
ilu Asp Thr Ser Leu Ser Thr Ser Pro 
1180 1185 



GCG ACA GGA GTC AGG AAA AAA Afcc TGT TCC TTC CGT ATA AAG GAA GAG 
Gly Thr Gly Val Arg Lys Lys T&r Cys Ser Phe Arg He Lys Glu Glu 
1190 1195 1200 




AAG GAC GTG AAA ACG CAC CTA GTC 
Lys Asp Val Lys Thr His Leu Va} 
1205 1210 



CCA GAA TGT CAG AAC AGT CTT CAC 
Pro Glu Cys Gin Asn Ser Leu His 
1215 



GGC ACA AGC ACA TCA \ GCA ACC CCA GAT GGC AGT CAC CTT 
Gly Thr Ser Thr Ser yvia Thr Pro Asp Gly Ser His Leu 
1225 \ 1230 1235 



GCAyfeTA GAT GAC TTA AAG AAC GCT 
Val Asp Asp Leu Lys Asn Ala G\) 
1240 

ATT GGG ATT TCA AAG GAA GAT GAT G; 
He Gly He Ser Lys Glu Asp Asp Gil 
1255 12i 

GAA GAA ACT ATT TCC "CCA AGT TTA AAT^ 
Glu Glu Thr He Ser Pro Ser Leu Asn 
1270 1275 

CAG GAC AAA TCA GAT GTT CAA AAC ACT 
Gin Asp Lys Ser Asp Val Gin Asn Thr 
1285 1290 



AAT ATA GAA GGC ACT ATT TCC TAT CCC CT€ 
Asn He Glu Gly Thr He Ser Tyr Pro Let 
1300 1305 



GAG TCA AAA TTA GGT CCA GAT 
Glu Ser Lys Leu Gly Pro Asp 
1245 1250 

AGA CAG ACA GAC TCT AAA AAA 
Arg Gin Thr Asp Ser Lye Lys 
1265 



AAA 



ACA GAT GTG ATA CAT GGA 
.ys Thr Asp Val He His Gly 
1280 



\G CTA ACA GTG GAA ACG ACA 
|Ln Leu Thr Val Glu Thr Thr 
1295 

GAA GAA ACC AAA ATT ACA 
Glu Glu Thr Lys He Thr 
1310 1315 



CGC TAT TTC CCC GAT GAA ACG ATC AAT GCt\tGT AAA ACA ATC AAG TCC 
Arg Tyr Phe Pro Asp Glu Thr He Asn Alalcys Lys Thr Met Lys Ser 
1320 1325\ 1330 



AGA AGC TTC GTC TAT TCC CGG GGA AGA AAG 
Arg Ser Phe Val Tyr Ser Arg Gly Arg Lys 
1335 1340 

CAG GAT GTA GAG TAC AGT TCA ATC ACG GAC 
Gin Asp Val Glu Tyr iSer Ser He Thr Asp 



CTG GTC GGT GGG GTT AAC 
ieu Val Gly Gly Val Asn 
1345 

\G CAA TTG ACG ACG GAA 
Ln Gin Leu Thr Thr Glu 



3762 



3810 



3858 



3906 



3954 



4002 



4050 



4098 



4146 



4194 



4242 



4290 



4338 



4386 



4434 
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1350 




1355 



1360 



TGG CAA TGC CAA GTT CAA AAG ATC ACG CGC TCT CAT AGC ACA GAT ATT 4482 
Trp Gin Cys Gin Val Gin L*a He Thr Arg Ser Hia Ser Thr Asp He 
1365 1370 1375 

CCT TAC ATT GTG TCG GAA GCt GCA GTG CAA GCT GAG CAA AAA GAG CAG 4530 
Pro Tyr He Val Ser Glu Alk Ala Val Gin Ala Glu Gin Lya Glu Gin 
1380 1385 \ 1390 1395 

TTT GCA GAT ATG CAA GAT GAA\ CAC CAT GTC GCT GAA GCA ATT CCT CGA 4578 
Phe Ala Aep Met Gin Asp Glu \ His Hia Val Ala Glu Ala He Pro Arg 
1400 1 1405 1410 

ATC CCT CGC TTG TCC CTA ACC KtT ACT GAC AGA AAT GGG ATG GAA AAC 4626 
He Pro Arg Leu Ser Leu Thr lie Thr Asp Arg Asn Gly Met Glu Asn 
1415 1 1420 1425 

TTA CTG TCT GTG AAG CCA GAT dAA ACT TTG GGA TTC CCA TCT CTC AGG 4674 
Ser Val Lys Pro Asp din Thr Leu Gly Phe Pro Ser Leu Arg 
1430 l\^2S 1440 

AGT TTA CAT GGA CAT ctT AGG AAT GTG AAA TCC ATT CAG GGA 4722 
Ser Leu His Gly His Pio Arg Asn Val Lys Ser He Gin Gly 
1445 1450 \ 1455 

G TTA GAC AGA TCT GGA CAT GC4 AGT AGT GTA AGC AGC TTA GTA ATT 4770 
ys Leu Asp Arg Ser Gly Hia Aid Ser Ser Val Ser Ser Leu Val He 
1460 1465 \ 1470 1475 

GTG TCT GGA ATG ACA GCA GAA GAA \ AAA AAG GTT AAG AAA GAG AAA GCT 4818 
Val Ser Gly Met Thr Ala Glu Glu kya Lys Val Lys Lys Glu Lys Ala 
1480 I 1485 1490 

TCC ACA GAA ACT GAA TGC T AGTCTG^TTT GTTTCTTTAA XTTTTTTTTT 4867 
Ser Thr Glu Thr Glu Cys 
1495 



TAACAGTCAG AACCACTAAT GGGTGTCATC TTGGCCATCC TAAACATCCA TCCAATTTCC 
TAAAAACATT TTCCCTT 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS ^ 

(A) LENGTH: 1497 amino ^cids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



4927 
4944 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Met Tyr He Arg Val Ser Tyr Asp Thr Ly\ Pro Asp Ser Leu Leu His 

15 idy 15 

Leu Met Val Lys Asp Trp Gin Leu Glu Leu ^ro Lys Leu Leu He Ser 
20 25 \ 30 



Val His Gly Gly Leu Gin Asn Phe Glu Met Gin Pro Lys Leu Lys Gin 



- 143 



35 



40 



45 



Val Phe Gly Lye Gly Leu lie Lya Ala Ala Met Thr Thr Gly Ala Trp 

50 A 60 

He Ph Thr Gly Gly vAl Ser Thr Gly Val He Ser His Val Gly Asp 
65 7iD 75 80 

Ala Leu Lys Asp His Ser\ser Lys Ser Arg Gly Arg Val Cys Ala lie 
85 \ 90 95 

Gly He Ala Pro Trp Gly lae Val Glu Asn Lys Glu Asp Leu Val Gly 
100 \ 105 110 

Lys Asp Val Thr Arg Val TyV Gin Thr Met Ser Asn Pro Leu Ser Lys 
115 \ 120 125 

Leu Ser Val Leu Asn Asn Ser\His Thr His Phe He Leu Ala Asp Asn 
130 135 \ 140 

eu Gly Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Leu Leu 
150 \ 155 160 

8 His He Ser Leu Gin Lyp He Asn Thr Arg Leu Gly Gin Gly 
165 \ 170 175 

1 Pro Leu Val Gly Leu Val Val\Glu Gly Gly Pro Asn Val Val Ser 
180 \l85 190 

He Val Leu Glu Tyr Leu Gin Glu Glu Pro Pro He Pro Val Val He 
195 200 \ 205 

Cys Asp Gly Ser Gly Arg Ala Ser Asto He Leu Ser Phe Ala His Lys 
210 215 \ 220 

Tyr Cys Glu Glu Gly Gly He He AsnVclu Ser Leu Arg Glu Gin Leu 
225 230 \ 235 240 

Leu Val Thr He Gin Lys Thr Phe Asn Tyr Asn Lys Ala Gin Ser His 
245 250 255 

Gin Leu Phe Ala He He Met Glu Cys Met Lys Lys Lys Glu Leu Val 

260 265 \ 270 

Thr Val Phe Arg Met Gly Ser Glu Gly Gln\Gln Asp He Glu Met Ala 
275 280 \ 285 

He Leu Thr Ala Leu Leu Lys Gly Thr Asn yal Ser Ala Pro Asp Gin 
290 295 \ 300 

Leu Ser Leu Ala Leu Ala Trp Asn Arg Val ^Lp He Ala Arg Ser Gin 
305 310 3I5 320 

He Phe Val Phe Gly Pro His Trp Thr Pro Ldu Gly Ser Leu Ala Pro 

325 330 1 335 

Pro Thr Asp Ser Lys Ala Thr Glu Lys Glu LyL Lys Pro Pro Met Ala 
340 345 \ 350 
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Thr Thr Lye Oly dlV Arg Gly Lys Gly Lys Gly Lys Lys Lys Gly Lys 

355 / \ 360 365 

Val Lys Glu Glu yal\ Glu Glu Glu Thr Asp Pro Arg Lys lie Glu Leu 

370 1 \ 375 380 



Leu Asn Trp Val Kan i^la Leu Glu Gin Ala Met Leu Asp Ala Leu Val 
385 \ 3,90 395 400 



Leu Asp Arg Val Asp Phe Val Lys Leu Leu lie Glu Asn Gly Val Asn 
405 \ 410 415 



Met Gin His Phe Leu Thi He Pro Arg Leu Glu Glu Leu Tyr Asn Thr 
420 \ 425 430 

Arg Leu Gly Pro Pro AsnlThr Leu His Leu Leu Val Arg Asp Val Lys 
435 \ 440 445 

Lys Ser Asn Leu Pro Pro Asp Tyr His He Ser Leu He Asp lie Gly 
450 455 460 



Leu Val Leu Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn Tyr Thr 
465 470 \ 475 480 




Arg Lys Asn Phe Arg Thr LeuXTyr Asn Asn Leu Phe Gly Pro Lys Arg 
485 \ 490 495 

Ala Leu Lys Leu Leu dly Met Glu Asp Asp Glu Pro Pro Ala 
500 \ 505 510 

Lys a/iy Lys Lys Lys Lys Lys L^s Lys Lys Glu Glu Glu He Asp He 
515 52P 525 

p Val Asp Asp Pro Ala Val Ser\ Arg Phe Gin Tyr Pro Phe His Glu 
530 535 \ 540 

Leu Met Val Trp Ala Val Leu Met Lys Arg Gin Lys Met Ala Val Phe 
545 550 \ 555 560 

Leu Trp Gin Arg Gly Glu Glu Ser M^t Ala Lys Ala Leu Val Ala Cys 
565 \ 570 575 



Lys Leu Tyr Lys Ala Met Ala His G]\u Ser Ser Glu Ser Asp Leu Val 

580 58f5 590 

Asp Asp He Ser Gin Asp Leu Asp Aak Asn Ser Lys Asp Phe Gly Gin 
595 600 1 605 

Leu Ala Leu Glu Leu Leu Asp Gin Seri Tyr Lys His Asp Glu Gin He 
610 615 1 620 

Ala Met Lys Leu Leu Thr Tyr Glu Leul Lys Asn Trp Ser Asn Ser Thr 
625 630 I 635 640 

Cys Leu Lys Leu Ala Val Ala Ala LyslHis Arg Asp Phe He Ala His 
645 1650 655 



-\145 - 



Thr Cya Ser Gin Met' lieu L u Thr Asp Met Trp Met Gly Arg Leu Arg 



665 



670 



Met Arg Lya Asn Pro Gly Leu Lys Val He Met Gly He Leu Leu Pro 

o75 ^ * - 



680 



685 




I^S 4®" ^9 Thr Tyr Asp Asp Phe Ser Tyr 

690 \ 695 700 

Gin Thr Ser Lys Glu hin Glu Asp Gly Lys Glu Lys Glu Glu Glu Asn 

7lld 715 720 

Thr Asp Ala Asn Ala AbA Ala Gly Ser Arg Lys Gly Asp Glu Glu Asn 

725 1 730 735 

Glu His Lys Lys Gin Argllle He Pro He Gly Thr Lys He Cys Lvs 
740 1 745 750 

Phe Tyr Asn Ala Pro He k/al Lys Phe Trp Phe Tyr Thr He Ser Tyr 
755 1 760 755 

Leu Gly Tyr Leu Leu Leu ke Asn Tyr Val He Leu Val Arg Met Asp 

770 -^75 ^ 

4^ Ser Tyr He Val Ser 

790 \ 795 800 



Leu yAla Leu Glu Lys He Ar^ Glu He 
805 

feu Ser Gin Lys He Lys Val\Trp Leu 
820 \ 825 



Asp Leu Val Ala He Ser Thr Hhe Met 
835 ^0 

Ser Val Met Met He Gly Lys M^t Met 
850 855 

Val He Met Leu Val Val Leu Me^ Ser 
865 870 

He Leu His Pro Glu Glu Lys Pro\ser 
885 

Phe Tyr Met Pro Tyr Trp Met He \yr 
900 9to5 

He Asp Leu Tyr Ala Met Glu He Ai 
915 920 

Tyr Asp Glu Glu Gly Lys Arg Leu Pre 
930 935 

Leu Thr Pro Ala Leu Met Ala Cys Tyr 
945 950 



Leu Met Ser Glu Pro Gly Lya 
810 815 

Gin Glu Tyr Trp Asn He Thr 
830 

He Gly Ala Met Ala Thr Arg 
845 

He Asp Met Leu Tyr Phe Val 
860 

Phe Gly Val Ala Arg Gin Ala 
875 880 

Trp Lys Leu Ala Arg Asn He 
890 895 

Gly Glu Val Phe Ala Asp Gin 
910 

Pro Pro Cys Gly Glu Asn Leu 
925 

Pro Cys He Pro Gly Ala Trp 
940 

:.eu Leu Val Ala Asn He Leu 
955 960 
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Leu Val Aan Leu Le^ lie Ala Val Ph Asn Asn Thr Ph Phe Glu Val 

970 975 

Lys Ser lie Ser Asn\ Gin Val Trp Lys Phe Gin Arg Tyr Gin Leu lie 

980 \ 985 990 




Met Thr Phe His Asp Arg Pro Val Leu Pro Pro Pro Met He lie Leu 
995 \ 1000 1005 

Ser His He Tyr He I lie He Met Arg Leu Ser Gly Arg Cys Arg Lys 
1010 \ 1015 1020 

Lys Arg Glu Gly Asp Gl^ Glu Glu Arg Asp Arg Gly Leu Lys Leu Phe 
1025 1030 1035 1040 

Leu Ser Asp Glu Glu Led Lys Arg Leu His Glu Phe Glu Glu Gin Cys 
1045 1 1050 1055 

Val Gin Glu His Phe Argl Glu Lys Glu Asp Glu Gin Gin Ser Ser Ser 
1060 I 1065 1070 

u Arg He Arg Val \Thr Ser Glu Arg Val Glu Asn Met Ser Met 
1075 \ 1080 1085 

Leu Glu Glu He Asn ilu Arg Glu Thr Phe Met Lys Thr Ser Leu 
1090 li095 1100 

Gin Thr Val Asp Leu Arg l4u Ala Gin Leu Glu Glu Leu Ser Asn Arg 
1105 1110 \ 1115 1120 

Met Val Asn Ala Leu Glu Asii Leu Ala Gly He Asp Arg Ser Asp Leu 
1125 \ 1130 1135 

He Gin Ala Arg Ser Arg Ala ^er Ser Glu Cys Glu Ala Thr Tyr Leu 
1140 \ 1145 1150 

Leu Arg Gin Ser Ser He Asn sda: Ala Asp Gly Tyr Ser Leu Tyr Arg 
1155 lll&O 1165 

Tyr His Phe Asn Gly Glu Glu Le\A Leu Phe Glu Asp Thr Ser Leu Ser 
1170 1175 \ 1180 



Thr Ser Pro Gly Thr Gly Val Arg \Lys Lys Thr Cys Ser Phe Arg He 
1185 1190 \ 1195 1200 

Lys Glu Glu Lys Asp Val Lys Thr ALb Leu Val Pro Glu Cys Gin Asn 
1205 \ 1210 1215 

Ser Leu His Leu Ser Leu Gly Thr sAr Thr Ser Ala Thr Pro Asp Gly 
1220 121^5 1230 

Ser His Leu Ala Val Asp Asp Leu Ly^ Asn Ala Glu Glu Ser Lys Leu 
1235 1240 \ 1245 

Gly Pro Asp He Gly lie Ser Lys Glu\Asp Asp Glu Arg Gin Thr Asp 
1250 1255 \ 1260 
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Ser Lys hym Glu Glu Tnr He Ser Pro Ser Leu Aan Lys Thr Asp Val 
1265 12^70 1275 1280 

He His Gly Gin Asp Lys Ser Asp Val Gin Asn Thr Gin Leu Thr Val 
1285 \ 1290 1295 

Glu Thr Thr Asn He Glu Gly Thr He Ser Tyr Pro Leu Glu Glu Thr 
1300 y 1305 1310 

Lys He Thr Arg Tyr Phe pAo Asp Glu Thr He Asn Ala Cys Lys Thr 
1315 \ 1320 1325 

Met Lys Ser Arg Ser Phe Val Tyr Ser Arg Gly Arg Lys Leu Val Gly 
1330 1335 1340 

Gly Val Asn Gin Asp Val Glul Tyr Ser Ser He Thr Asp Gin Gin Leu 
1345 1350 \ 1355 1360 

Thr Thr Glu Trp Gin Cys Gin Wal Gin Lys He Thr Arg Ser His Ser 
1365 \ 1370 1375 

Thr Asp He Pro Tyr He Val Ser Glu Ala Ala Val Gin Ala Glu Gin 
1380 \ 1385 1390 

u Gin Phe Ala Asp Met GlVi Asp Glu His His Val Ala Glu Ala 
1395 1400 1405 

Pro Arg He Pro Arg Leu Ser\ Leu Thr He Thr Asp Arg Asn Gly 
1410 1415 \ 1420 



Met Glu Asn Leu Leu Ser Val Lys fro Asp Gin Thr Leu Gly Phe Pro 
1425 1430 I 1435 1440 

Ser Leu Arg Ser Lys Ser Leu His dly His Pro Arg Asn Val Lys Ser 
1445 \ 1450 1455 

He Gin Gly Lys Leu Asp Arg Ser Gly His Ala Ser Ser Val Ser Ser 
1460 14fe5 1470 



Leu Val He Val Ser Gly Met Thr Aid Glu Glu Lys Lys Val Lys Lys 
1475 1480 I 1485 

Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1490 1495 

(2) INFORMATION FOR SEQ ID N0i8: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 5055 base pairs\ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLCXSY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 



(A) NAMB/KSY: CDS 

(B) LOCATIOKs 346. .4945 



(xi) SEQUENCE D^S\ 
ACTCATTATA GGGNTCGAGC 
CTGCTGTGGC AGGGTCAGAA 
TTTGTAATTC CTAGCACGAA 
CATATCCCCC CTCTGCCAAO 
GAGACTCAGC CTGAGAAATG 
TATGGAGTTC TTGAATTCCA 



IPTION: SEQ ID NO: 8: 
ICGCCCGG GCAGGTTTGA 
AtCTTGGATA GAGAAAACCT 
AGACTCTAAC AGGTGTTGCT 
TGCAACACCC AGCAAAAATG 
GTCtGTTGCC AAGCACACCC 
GGGTGGCGGA TATTCCAATA 



GCTGTGCCCT CTCCATTCCA 

TTTGCAAACG GGAATGTATC 

GTGGCCAGTT CACCAACCAG 

AAGAGGAAAG CAAACAGGTG 

AGAGCTACCC AACAGATTCC 

AAGCC ATG TAT ATC 
Met Tyr lie 
1 




CGT GTA TCC TAT GAC ACC AAGlCCA GAC TCA CTG 
Ser Tyr Asp Thr Lys \Pro Asp Ser Leu 
10 

TGG CAG CTG GAA CTC tCC AAG CTC TTA 
Trp Gin Leu Glu Leu Lys Leu Leu 

25 \ 30 



CAG AAC TTT GAG ATG CI 
Gin Asn Phe Glu Met 
40 



CCC AAG CTG 
Pro Lys Leu 
45 



CTG ATC AAG GOT GCT ATG ACC ACC GGG 
Leu lie Lys Ala Ala Met Thr Thr Gly 
55 60 



GGG GGT GTC AGC ACA GGT GTT ATC AGffi CAC GTA 
Gly Gly Val Ser Thr Gly Val He Sez\ His Val 
70 75 

GAC CAC TCC TCC AAG TCC AGA GGC CGG GTT TGT 
Asp His Ser Ser Lys Ser Arg Gly Arg v^l Cys 
85 90 

CCA TGG GGC ATC GTG GAG AAT AAG GAA GAb CTG 
Pro Trp Gly He Val Glu Asn Lys Glu Asb Leu 
100 105 \ 110 

ACA AGA GTG TAG CAG ACC ATG TCC AAC CCtIcTA 
Thr Arg Val Tyr Gin Thr Met Ser Asn Pro\Leu 
120 125 

CTC AAC AAC TCC CAC ACC CAC TTC ATC CTG bCT 
Leu Asn Asn Ser His Thr His Phe He Leu ^la 
135 140 



CTC CAT CTC ATG GTG 
Leu His Leu Met Val 
15 

ATA TCT GTG CAT GGA 
He Ser Val His Gly 
35 

AAA CAA GTC TTT GGG 
Lys Gin Val Phe Gly 
50 

GCC TGG ATC TTC ACC 
Ala Trp He Phe Thr 
65 

GGG GAT GCC TTG AAA 
Gly Asp Ala Leu Lys 

80 

GCT ATA GGA ATT GCT 
Ala He Gly He Ala 
95 

GTT GGA AAG GAT GTA 
Val Gly Lys Asp Val 
115 

AGT AAG CTC TCT GTG 
Ser Lys Leu Ser Val 
130 

GAC AAT GGC ACC CTG 
Asp Asn Gly Thr Leu 
145 



GGC AAG TAT GGC GCC GAG GTG AAG CTG CGA AGG CTG CTG GAA AAG CAC 



60 
120 
180 
240 
300 
354 

402 

450 

498 

546 

594 

642 

690 

738 

786 

834 



149 \- 



Gly Ly« Tyr Gly Ala 

150 



ATC TCC CTC CAO AAG A 
II Ser Leu Gin Lys II 
165 




4 



ilu Val Lys Leu Arg Arg Leu Leu Glu Lys His 

^ 155 160 



AAC ACA AGA CTG GGG CAG GGC GTG CCC CTC 
Asn Thr Arg Leu Gly Gin Gly Val Pro L u 
170 175 



GTG GGT CTC GTG GTG GAg\gGG GGC CCT AAC GTG GTG TCC ATC GTC TTG 
Val Gly Leu Val Val Glu bly Gly Pro Asn Val Val Ser He Val Leu 
180 185 \ 190 195 

GAA TAC CTG CAA GAA GAG CCT CCC ATC CCT GTG GTG ATT TGT GAT GGC 
Glu Tyr Leu Gin Glu Glu Plro Pro He Pro Val Val He Cys Asp Gly 
200 \ 205 210 




AGC GGA CGT GCC TCG GAC AT< 
Ser Gly Arg Ala Ser Asp lU 
215 



CTG TCC TTT GCG CAC AAG TAC TGT GAA 
Leu Ser Phe Ala His Lys Tyr Cys Glu 
220 225 



GGA ATA ATA AAT GAG \ TCC CTC AGG GAG CAG CTT CTA GTT ACC 
Gly He He Asn Glu Iser Leu Arg Glu Gin Leu Leu Val Thr 
230 1235 240 

AAA ACA TTT AAT TAT MAT AAG GCA CAA TCA CAT CAG CTG TTT 
Lys Thr Phe Asn Tyr Abn Lys Ala Gin Ser His Gin Leu Phe 
250 \ 255 



ATA ATG GAG TGC ATG AAi 
He Met Glu Cys Met Lys^ 
265 



AAG AAA GAA CTC GTC ACT GTG TTC 
Lys Lys Glu Leu Val Thr Val Phe 
270 275 



GGT TCT GAG GGC CAG CAG 
Gly Ser Glu Gly Gin Gin 
280 



feAC ATC GAG ATG GCA ATT TTA ACT 
^sp He Glu Met Ala He Leu Thr 
285 290 



GCC CTG CTG AAA GGA ACA AAC GTA T 
Ala Leu Leu Lys Gly Thr Asn Val Sei 
295 30 



GCT CCA GAT CAG CTG AGC TTG 
Ala Pro Asp Gin Leu Ser Leu 
305 



GCA CTG GCT TGG AAC CGC GTG GAC ATa\gCA CGA AGC CAG ATC TTT GTC 
Ala Leu Ala Trp Asn Arg Val Asp He ^la Arg Ser Gin He Phe Val 
310 315 \ 320 



TTT GGG CCC CAC TGG ACG CCC CTG GGA 
Phe Gly Pro His Trp Thr Pro Leu Gly 
325 330 

AGC AAA GCC ACG GAG AAG GAG AAG AAG CC 
Ser Lys Ala Thr Glu Lys Glu Lys Lys 
340 345 



CTG GCA CCC CCG ACG GAC 
Leu Ala Pro Pro Thr Asp 
335 

CCC ATG GCC ACC ACC AAG 
Pro Met Ala Thr Thr Lys 
350 355 



GGA GGA AGA GGA AAA GGG AAA GGC AAG AAg\ AAA GGG AAA GTG AAA GAG 
Gly Gly Arg Gly Lys Gly Lys Gly Lys Lys\Lys Gly Lys Val Lys Glu 
360 365 \ 370 



GAA GTG GAG GAA GAA ACT GAC CCC CGG AAG 
Glu Val Glu Glu Glu Thr Asp Pro Arg Lys 
375 380 



kTA GAG CTG CTG AAC TGG 
lie Glu Leu Leu Asn Trp 
385 



882 



930 



978 



1026 



1074 



1122 



1170 



1218 



1266 



1314 



1362 



1410 



1458 



1506 
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GTO AAX OCT TTO GAG CAAlGCG ATG CTA GAT GCT TTA GTC TTA GAT CGT 
Val Asn Ala Leu Glu Gln\Ala Met Leu Asp Ala L u Val Leu Aap Arg 

390 \ 395 400 



1554 



GTC GAC TTT GTO AAG CTC 
Val Asp Phe Val Lya Leu 
405 



ATT GAA AAC GGA GTG AAC ATG CAA CAC 
Leu He Glu Asn Gly Val Asn Met Gin His 
410 415 



1602 



TTT CTG ACC ATT CCG AGG 
Phe Leu Thr He Pro Arg Lei 
420 425 

CCA CCA AAC ACA CTT CAT CTC 
Pro Pro Asn Thr Leu His Lev 
440 



GAG GAG CTT TAT AAC ACA AGA CTG GGT 
Glu Glu Leu Tyr Asn Thr Arg Leu Gly 
430 435 

CTG GTG AGG GAT GTG AAA AAG AGC AAC 
Leu Val Arg Asp Val Lya Lys Ser Asn 
445 450 



CTT CCG CCT GAT TAC CAC ATC AGC CTC ATA GAC ATC GGG CTC GTG CTG 
Leu Pro Pro Asp Tyr His Ilel Ser Leu He Asp He Gly Leu Val Leu 
455 460 465 



1650 



1698 



1746 




GAG TAC CTC ATG GGA GGA GCC 
r Leu Met Gly Gly Ala 
470 



\TAC CGC TGC AAC TAC ACT CGG AAA AAC 
r Arg Cys Asn Tyr Thr Arg Lys Asn 



175 



TTT/CGG ACC CTT TAC AAC AAC 
Pha Arg Thr Leu Tyr Asn Asn L^ 
485 490 

CTT AAA CTT CTG GGA ATG GAA GA^ 
Leu Lys Leu Leu Gly Met Glu As{ 
500 505 

AAA AAA AAA AAA AAG AAA AAG GAG 
Lys Lys Lys Lys Lys Lys Lys Glu 
520 

GAC CCT GCC GTG AGT CGG TTC CAG 
Asp Pro Ala Val Ser Arg Phe Gin 
535 



480 

TTT GGA CCA AAG AGG CCT AAA GCT 
Phe Gly Pro Lys Arg Pro Lys Ala 
495 

GAT GAG CCT CCA GCT AAA GGG AAG 
Asp Glu Pro Pro Ala Lys Gly Lys 
510 515 

3AA GAG ATC GAC ATT GAT GTG GAC 
rlu Glu He Asp He Asp Val Asp 
525 530 

,T CCC TTC CAC GAG CTG ATG GTG 
[T Pro Phe His Glu Leu Met Val 
540 545 



1794 



1842 



1890 



1938 



1986 



TGG GCA GTG CTG ATG AAA CGC CAG AAA ATG GCA GTG TTC CTC TGG CAG 
Trp Ala Val Leu Met Lys Arg Gin Lys\ Met Ala Val Phe Leu Trp Gin 
550 555 \ 560 



2034 



CGA GGG GAA GAG AGC ATG GCC AAG GCC k:TG GTG GCC TGC AAG CTC TAC 
Arg Gly Glu Glu Ser Met Ala Lys Ala Leu Val Ala Cys Lys Leu Tyr 
565 570 \ 575 



2082 



AAG GCC ATG GCC CAC GAG TCC TCC GAG 
Lys Ala Met Ala His Glu Ser Ser Glu 
580 585 



;T GAT CTG GTG GAT GAC ATC 
ir Asp Leu Val Asp Asp He 
590 595 



2130 



TCC CAG GAC TTG GAT AAC AAT TCC AAA GAC 
Ser Gin Asp Leu Asp Asn Asn Ser Lys Asj 

600 605 



TTC GCC CAG CTT GCT TTG 
Phe Gly Gin Leu Ala Leu 
610 



2178 



GAG TTA TTA GAC CAG TCC TAT AAG CAT GAC\GAG CAG ATC GCT ATG AAA 
Glu Leu Leu Asp Gin Ser Tyr Lys His AspiGlu Gin He Ala Met Lys 



2226 
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• 



615 



620 



625 



CTC CTG ACC TAC GAG CTG\ AAA AAC TGG AGC AAC TOG ACC TGC CTC AAA 
Leu Leu Thr Tyr Glu LeulLya Asn Trp Ser Asn Ser Thr Cys Leu Lvs 
630 \ 635 640 

CTG GCC GTG GCA GCC AAA 
Leu Ala Val Ala Ala Lys 
645 



:AC CGG GAC TTC ATT GCT CAC ACC TGC AGC 
lis Arg Asp Phe He Ala His Thr Cys Ser 
650 655 



2274 



2322 



CAG ATG CTG CTG ACC GAT AT 
Gin Met Leu Leu Thr Asp 
660 665 

AAC CCC GGC CTG AAG GTT ATC 
Asn Pro Gly Leu Lys Val Il€ 
680 



TGG ATG GGA AGA CTG CGG ATG CGG AAG 
Trp Met Gly Arg Leu Arg Met Arg Lys 
670 675 

ATG GGG ATT CTT CTA CCC CCC ACC ATC 
Met Gly He Leu Leu Pro Pro Thr He 
685 690 



TTG TTT TTG GAA TTT CGC ACaItAT GAT GAT TTC TCG TAT CAA ACA TCC 
Leu Glu Phe Arg Thr \Tyr Asp Asp Phe Ser Tyr Gin Thr Ser 
695 \ 700 705 




AAG ISAA AAC GAG GAT GGC AAA 
Lys/Glu Asn Glu Asp Gly Lys 
710 

-T GCA GAT GCT GGC TCA AGA AA^ 
Ala Asp Ala Gly Ser Arg Lyt 
725 730 

AAA CAG AGA AGT ATT CCC ATC GGA 
Lys Gin Arg Ser He Pro He Gly 
740 745 

GCG CCC ATT GTC AAG TTC TGG TTT 
Ala Pro He Val Lys Phe Trp Phe 

760 ■ 



AAA GAA GAG GAA AAT ACG GAT GCA 
Lys Glu Glu Glu Asn Thr Asp Ala 
720 

GGG GAT GAG GAG AAC GAG CAT AAA 
Gly Asp Glu Glu Asn Glu His Lys 
735 

\CA AAG ATC TGT GAA TTC TAT AAC 
^hr Lys He Cys Glu Phe Tyr Asn 
750 755 

^C ACA ATA TCA TAC TTG GGC TAC 
\r Thr He Ser Tyr Leu Gly Tyr 
765 770 



CTG CTG CTG TTT AAC TAC GTC ATC CT<* GTG CGG ATG GAT GGC TGG CCG 
Leu Leu Leu Phe Asn Tyr Val He Levi Val Arg Met Asp Gly Trp Pro 
775 780\ 785 



TCC CTC CAG GAG TGG ATC GTC ATC TCC 
Ser Leu Gin Glu Trp He Val He Ser 
790 795 

GAG AAG ATA CGA GAG ATC CTC ATG TCA 
Glu Lys He Arg Glu He Leu Met Ser G^ 
805 810 

AAA ATC AAA GTT TGG CTT CAG GAG TAC 
Lys He Lys Val Trp Leu Gin Glu Tyr Tr^ 
820 825 



ATC GTG AGC CTG GCG TTA 
^r He Val Ser Leu Ala Leu 
800 

CCA GGC AAA CTC AGC CAG 
Pro Gly Lys Leu Ser Gin 
815 

AAC ATC ACA GAT CTC GTG 
Asn He Thr Asp Leu Val 
830 835 



GCC ATT TCC ACA TTC ATG ATT GGA GCA ATt\cTT CGC CTA CAG AAC CAG 
Ala He Ser Thr Phe Met He Gly Ala Ile\Leu Arg Leu Gin Asn Gin 
840 845 \ 850 



2370 



2418 



2466 



2514 



2562 



2610 



2658 



2706 



2754 



2802 



2850 



2898 
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CCC TAG ATO GGC TAT 
Pr Tyr Met Oly Tyr Glj 
855 



CGG GTG ATC TAG TGT GTG GAT ATG ATC TTG 
Arg Val lie Tyr Cya Val Asp He He Phe 
860 865 



TGG TAG ATG G6T GTG GTG ^AG ATC TTT GGT GTG AAG AAG TAT GTG GGG 
Trp Tyr He Arg Val Leu Asp lie Phe Gly Val Asn Lys Tyr Leu Gly 

870 \ .875 880 



2946 



2994 



GGA TAG GTG ATG ATG ATT GGA AAG ATG ATG ATG GAG ATG GTG TAG TTT 
Pro Tyr Val Met Met He Gly Lys Met Met He Asp Met Leu Tyr Phe 
885 BVO 895 

GTG GTG ATG ATG GTG GTG GI^Q 
Val Val He Met Leu Val vil 
900 905 



GGG ATT GTG CAT GGA GAG GAG 
Ala He Leu His Pro Glu G 
920 



ATG TTG TAG ATG CCC TAG TCtG 
He Phe Tyr Met Pro Tyr Trip 
935 




GTG ATG AGT TTG GGA GTA GGG GGT CAA 
Leu Met Ser Phe Gly Val Ala Arg Gin 
910 915 



AAG CGG TGT TGG AAA CTG GGG GGA AAG 
u Lys Pro Ser Trp Lys Leu Ala Arg Asn 
925 930 



ATG ATG TAT GGA GAG GTG TTT GGA GAG 
Met He Tyr Gly Glu Val Phe Ala Asp 
940 945 



GAG GTG TAG GGG ATG GAA ATT AAT CCT GGT TGT GGT GAG AAG 
e Asp Leu Tyr Ala Met\ Glu He Asn Pro Pro Cys Gly Glu Asn 
950 \955 960 



TAT GAT GAG GAG GGG AAG 
Tyr Asp Glu Glu Gly Lys 
965 970 



CTT CCT CCC TGT ATG GGG GGC GGG 
rg Leu Pro Pro Cys He Pro Gly Ala 
975 



TGG GTG AGT CCA GGA CTG ATG GC 
Trp Leu Thr Pro Ala Leu Met 
980 985 

GTG GTG GTG AAG CTG GTG ATT GGT 
Leu Leu Val Asn Leu Leu He Ala 
1000 



TGG TAT CTA CTG GTG GCG AAG ATG 
Cys Tyr Leu Leu Val Ala Asn He 
990 995 

^GTG TTG AAG AAT ACT TTG TTT GAA 
al Phe Asn Asn Thr Phe Phe Glu 
1005 1010 



GTA AAA TGA ATA TGG AAG GAG GTG T<te AAG TTG GAG GGA TAT GAG GTG 
Val Lys Ser He Ser Asn Gin Val Trb Lys Phe Gin Arg Tyr Gin Leu 
1015 lOiO 1025 

ATT ATG AGA TTT CAT GAG AGG CCA GTc\ CTG CCC CCA CGG ATG ATG ATT 
He Met Thr Phe His Asp Arg Pro Val \ Leu Pro Pro Pro Met He He 
1030 1035 \ 1040 



TTA AGG CAC ATG TAG ATG ATG ATT ATG 
Leu Ser His He Tyr He He He Met 
1045 1050 



CTG AGG GGG CGC TGG AGG 
eg Leu Ser Gly Arg Cys Arg 
1055 



3042 



3090 



3138 



3186 



3234 



3282 



3330 



3378 



3426 



3474 



3522 



AAA AAG AGA GAA GGG GAG CAA GAG GAA 
Lys Lys Arg Glu Gly Asp Gin Glu Glu 
1060 1065 

TTG CTT AGG GAG GAG GAG GTA AAG AGG CT^ 
Phe Leu Ser Asp Glu Glu Leu Lys Arg Lev 



GAT GGT GGA TTG AAG GTG 3570 
\g Asp Arg Gly Leu Lys Leu 
1070 1075 

CAT GAG TTG GAG GAG CAG 3618 
His Glu Phe Glu Glu Gin 
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• 



1080 

TGC GT6 CAO GAG CAC TTC 
Cys Val Gin Glu His Phe 
1095 

AGC GAG GAG CGC ATC CGG 
Ser Asp Glu Arg lie Arg 
1110 



1085 



1090 



GAG AAG GAG GAT GAG GAG GAG TCG TCC 
rg Glu Lys Glu Asp Glu Gin Gin Ser Ser 
1100 1105 

GtC ACT TCT GAA AGA GTT GAA AAT ATG TCA 
V41 Thr Ser Glu Arg Val Glu Asn Met Ser 
1115 1120 



ATG AGG TTG GAA GAA ATC AAT GAA AGA GAA ACT TTT ATG AAA ACT TCC 
Met Arg Leu Glu Glu lie AsA Glu Arg Glu Thr Phe Met Lya Thr Ser 
1125 llio 1135 

CTG CAG ACT GTT GAC CTT CGA\CTT GCT CAG CTA GAA GAA TTA TCT AAC 
Leu Gin Thr Val Asp Leu ArglLeu Ala Gin Leu Glu Glu Leu Ser Asn 
1140 1145 \ 1150 1155 




3X5 AAT GCT CTT GAA 
1 Asn Ala Leu Glu 
1160 



VT CTT GCG GGA ATC GAC AGG TCT GAC 
Lsn Leu Ala Gly lie Asp Arg Ser Asp 

1170 



CAG GCA CGG TCC CGG 
Gin Ala Arg Ser Arg hi 
1175 

CGG CAA AGC AGC ATC AA' 
Arg Gin Ser Ser lie As: 
1190 119' 

CAT TTT AAC GGA GAA GAG 
Phe Asn Gly Glu Glu 
1210 

TCC ACG TCA CCA GGG ACA GGA GTC 
Ser Thr Ser Pro Gly Thr Gly Val 
1220 1225 



ATA AAG GAA GAG AAG GAC GTG AAA AC 
He Lys Glu Glu Lys Asp Val Lys TM 
1240 



1165 



TCT TCT GAA TGT GAG GCA ACG TAT 
Ser Ser Glu Cys Glu Ala Thr Tyr 
1180 1185 

AGC GCT GAT GGC TAC AGC TTG TAT 
Ser Ala Asp Gly Tyr Ser Leu Tyr 
1200 

iTTA TTA TTT GAG GAT ACA TCT CTC 
eu Leu Phe Glu Asp Thr Ser Leu 
1215 



AAA AAA ACC TGT TCC TTC CGT 
Lys Lys Thr Cys Ser Phe Arg 
1230 1235 

CAC CTA GTC CCA GAA TGT CAG 
His Leu Val Pro Glu Cys Gin 
1245 1250 



AAC AGT CTT CAC CTT TCA CTG GGC ACA\ AGC ACA TCA GCA ACC CCA GAT 
Asn Ser Leu His Leu Ser Leu Gly Thr\ser Thr Ser Ala Thr Pro Asp 
1255 126a 1265 



3666 



3714 



3762 



3810 



3858 



3906 



3954 



4002 



4050 



4098 



4146 



GGC AGT CAC CTT GCA GTA GAT GAC TTA 
Gly Ser His Leu Ala Val Asp Asp Leu 
1270 1275 



\G AAC GCT GAA GAG TCA AAA 
Asn Ala Glu Glu Ser Lys 
1280 



TTA GGT CCA GAT ATT GGG ATT TCA AAG GAA GAT GAT GAA AGA CAG ACA 
Leu Gly Pro Asp He Gly He Ser Lys Glu Asp Asp Glu Arg Gin Thr 
1285 1290 1 1295 



4194 



4242 



GAC TCT AAA AAA GAA GAA ACT ATT TCC CC> 
Asp Ser Lys Lys Glu Glu Thr He Ser Pr 
1300 1305 



AGT TTA AAT AAA ACA GAT 
Ser Leu Asn Lys Thr Asp 
1310 1315 



4290 
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GTO ATA CAT GGA CAO GAC 
Val II HiB Gly Gin Asp 

1320 

GTG GAA ACG ACA AAT ATA 
Val Glu Thr Thr Asn II 
1335 



AAA TCA GAT GTT CAA AAC ACT CAG CTA ACA 
Lyo Ser Aap Val Gin Asn Thr Gin Leu Thr 
1325 1330 

3AA GGC ACT ATT TCC TAT CCC CTG GAA GAA 
Gly Thr lie Ser Tyr Pro Leu Glu Glu 
1340 1345 



ACC AAA ATT ACA CGC TAT T' 
Thr Lya lie Thr Arg Tyr PhV 
1350 



CCC GAT GAA ACG ATC AAT GCT TGT AAA 
Pro Asp Glu Thr lie Asn Ala Cys Lys 
1355 1360 



4338 



4386 



4434 



ACA ATG AAG TCC AGA AGC TTC GTC TAT TCC CGG GGA AGA AAG CTG GTC 
Thr Met Lys Ser Arg Ser Phe Val Tyr Ser Arg Gly Arg Lys Leu Val 
1365 13^0 1375 



4482 




GGT GGG GTT AAC CAG GAT GTA GAG TAC AGT TCA ATC ACG GAC CAG CAA 
Gly Gly-Val Asn Gin Asp Vail Glu Tyr Ser Ser lie Thr Asp Gin Gin 

1385 1 1390 1395 



ACG GAA TGG CAA TGC 
Thr Glu Trp Gin Cys 
1400 



ACA GAT ATT CCT TAC ATT 
Thr Asp lie Pro Tyr lie 
1415 



ICAA GTT CAA AAG ATC ACG CGC TCT CAT 
In Val Gin Lys He Thr Arg Ser His 
1405 1410 

\TG TCG GAA GCT GCA GTG CAA GCT GAG 
il Ser Glu Ala Ala Val Gin Ala Glu 
1420 1425 



CAA AAA GAG CAG TTT GCA GAT AO 
Gin Lys Glu Gin Phe Ala Asp Me^t 
1430 14] 



CAA GAT GAA CAC CAT GTC GCT GAA 
Gin Asp Glu His His Val Ala Glu 
5 1440 



GCA ATT CCT CGA ATC CCT CGC TTGl TCC CTA ACC ATT ACT GAC AGA AAT 
Ala He Pro Arg He Pro Arg Leu\ser Leu Thr He Thr Asp Arg Asn 
1445 1450 \ 1455 



4530 



4578 



4626 



4674 



4722 



GGG ATG GAA AAC TTA CTG TCT GTG 
Gly Met Glu Asn Leu Leu Ser Val 
1460 1465 

CCA TCT CTC AGG TCA AAA AGT TTA 
Pro Ser Leu Arg Ser Lys Ser Leu 
1480 



\G CCA GAT CAA ACT TTG GGA TTC 
fs Pro Asp Gin Thr Leu Gly Phe 
1470 1475 

IT GGA CAT CCT AGG AAT GTG AAA 
Hilp Gly His Pro Arg Asn Val Lys 
1485 1490 



TCC ATT CAG GGA AAG TTA GAC AGA TCt GGA CAT GCC AGT AGT GTA AGC 
Ser He Gin Gly Lys Leu Asp Arg Serl Gly His Ala Ser Ser Val Ser 
1495 1500 1505 



AGC TTA GTA ATT GTG TCT GGA ATG ACA 
Ser Leu Val He Val Ser Gly Met Thr 
1510 1515 

AAA GAG AAA GCT TCC ACA GAA ACT GAA 
Lys Glu Lys Ala Ser Thr Glu Thr Glu 
1525 1530 



[GCA 



GAA GAA AAA AAG GTT AAG 
la Glu Glu Lys Lys Val Lys 
1520 



AGTCTGTTTT GTTTCTTTAA 



4770 



4818 



4866 



4914 



4965 



lys 



TTTTTTTTTT TAACAGTCAG AAACCCACTA ATGGftTGTCA TCTTGGCCCA TCCTAAACAC 5025 
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ATMTCCAATT TCCTAAAAAC ATTTTCCCTT 



5055 



(2) INFORMATION FOR SSQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1533 amino acids 

(B) TYPE: am^no acid 
(D) TOPOLOGY :\ linear 

(ii) MOLECULE TYPE: ptotein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Tyr He Arg Val Ser Tyr Wap Thr Lys Pro Asp 
1 5 \ 10 

Leu Met Val Lys Asp Trp Gin ijeu Glu Leu Pro Lys 
20 \ 25 



ly Gly Leu Gin Asn Phte Glu Met Gin Pro 
35 



Ser Leu Leu His 
15 

Leu Leu He Ser 
30 

Lys Leu Lys Gin 
45 



e Gly Lys Gly Leu He Lys\Ala Ala Met Thr Thr Gly Ala Trp 
50 55 \ 60 



e Phe Thr Gly Gly Val Ser Thr Gly Val He Ser 
65 70 \ 75 

Ala Leu Lys Asp His Ser Ser Lys der Arg Gly Arg 

85 \ 90 

Gly He Ala Pro Trp Gly He Val gIu Asn Lys Glu 
100 ici5 

Lys Asp Val Thr Arg Val Tyr Gin Tht Met Ser Asn 
115 120 



His Val Gly Asp 
80 

Val Cys Ala He 
95 

Asp Leu Val Gly 
110 

Pro Leu Ser Lys 
125 



Leu Ser Val Leu Asn Asn Ser His Thr\His Phe He Leu Ala Asp Asn 
130 135 \ 140 



Gly Thr Leu Gly Lys Tyr Gly Ala Glu val Lys Leu 
145 150 \ 155 

Glu Lys His He Ser Leu Gin Lys He l^n Thr Arg 
165 1170 

Val Pro Leu Val Gly Leu Val Val Glu G^y Gly Pro 
180 185 

He Val Leu Glu Tyr Leu Gin Glu Glu Prjo Pro He 
195 200 



Cys Asp Gly Ser Gly Arg Ala Ser Asp Hi» Leu Ser Phe Ala His Lys 



210 



215 



Arg Arg Leu Leu 
160 

Leu Gly Gin Gly 
175 

Asn Val Val Ser 
190 

Pro Val Val He 
205 



Tyr Cys Glu Glu Gly Gly He He Asn Gl\, 
225 230 



220 

Ser Leu 
235 



Arg Glu Gin Leu 
240 
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Leu Val Thr lie 61n LyW Thr Phe 
245 

Gin Leu Phe Ala lie 11^ Met Glu 
260 

Thr Val Phe Arg Met Gly \ser Glu 
275 \ 280 

lie Leu Thr Ala Leu Leu iWs Gly 
290 2^5 

Leu Ser Leu Ala Leu Ala T^ Asn 

305 310 



Val Phe Gly Pro Kite Trp 
325 



Asp Ser Lys Ala Thi\ Glu 
340 

Lys Gly Gly Arg Gly tys 
55 560 

Glu Glu Val Glu Glu dilu 
375 



Asn Tyr Asn 

250 

Cys Met Lys 
265 

Gly Gin Gin 

Thr Asn Val 

Arg Val Asp 
315 

Thr Pro Leu 
330 

Lys Glu Lys 
345 

Gly Lys Gly 
Thr Asp Pro 



Lys Ala 

Lys Lys 

Asp lie 
285 

Ser Ala 

300 

He Ala 
Gly Ser 
Lys Pro 



Lys Lys 
365 

Arg Lys 
380 



Gin Ser His 

255 

Glu Leu Val 
270 

Glu Met Ala 

Pro Asp Gin 

Arg Ser Gin 
320 

Leu Ala Pro 
335 

Pro Met Ala 

350 

Lys Gly Lys 
He Glu Leu 



Trp Val Asn Ala Leu Glu Gin 
390 

Arg Val Asp Phe Val LyW Leu 
405 



Met Gin His Phe Leu Thr He Prd Arg 
420 \425 

Arg Leu Gly Pro Pro Asn Thr Leu ^is 
435 440 

Lys Ser Asn Leu Pro Pro Asp Tyr ri^is 
450 455 

Leu Val Leu Glu Tyr Leu Met Gly Gl^ 
465 470 



Arg Lys Asn Phe Arg Thr Leu Tyr 
485 



Asn\ 



Ala Met Leu Asp Ala Leu Val 
395 400 

Leu He Glu Asn Gly Val Asn 
410 415 

Leu Glu Glu Leu Tyr Asn Thr 
430 

Leu Leu Val Arg Asp Val Lys 
445 

He Ser Leu He Asp He Gly 
460 

Ala Tyr Arg Cys Asn Tyr Thr 
475 480 

Asn Leu Phe Gly Pro Lys Arg 
490 495 



Pro Lys Ala Leu Lys Leu Leu Gly Met 
500 505 

Lys Gly Lys Lys Lys Lys Lys Lys Lys 
515 520 

Asp Val Asp Asp Pro Ala Val Ser Arg 

530 535 



\G1u Asp Asp Glu Pro Pro Ala 
510 



Lys Glu Glu Glu He Asp He 
525 

le Gin Tyr Pro Phe His Glu 
540 
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# 



Leu Mot Val Trp Ala Val Ltfu Met Lys Arg Gin Lys Met Ala Val Phe 

545 550 \ 555 560 

L u Trp Gin Arg Gly Glu GlA Ser Met Ala Lys Ala Leu Val Ala Cys 
565 \ 570 575 

Lys Leu Tyr Lys Ala Met Ala\His Glu Ser Ser Glu Ser Asp Leu Val 
580 \ 585 590 

Asp Asp lie Ser Gin Asp Leu Wsp Asn Asn Ser Lys Asp Phe Gly Gin 

595 1600 605 

Leu Ala Leu Glu Leu Leu Asp bin Ser Tyr Lys His Asp Glu Gin He 
610 615 1 620 

Ala Met Lys Leu Leu Thr Tyr dlu Leu Lys Asn Trp Ser Asn Ser Thr 
625 630 \ 635 640 

8 Leu Ala Val Ala Ala Lys His Arg Asp Phe He Ala His 
645 \ 650 655 

Thr Cy^ Ser Gin Met Leu Leu Th)f Asp Met Trp Met Gly Arg Leu Arg 
/ 660 \ 665 670 

Arg Lys Asn Pro Gly Leu Ly8\val He Met Gly He Leu Leu Pro 
675 680\ 685 




Pro Thr He Leu Phe Leu Glu Phe lArg Thr Tyr Asp Asp Phe Ser Tyr 
690 695 1 700 

Gin Thr Ser Lys Glu Asn Glu Asp tly Lys Glu Lys Glu Glu Glu Asn 
705 710 \ 715 720 



Thr Asp 

Glu His 

Phe Tyr 

Leu Gly 
770 

Gly Trp 
785 

Leu Ala 
Leu Ser 
Asp Leu 



Ala Asn Ala Asp 

725 

Lys Lys Gin Arg 
740 

Asn Ala Pro He 
755 

Tyr Leu Leu Leu 



Ala Gly Ser Arg 
730 

Ser He Pr^ He 

74! 

Val Lys Phe\Trp 
760 

Phe Asn Tyr Vfil 
775 



Pro Ser Leu Gin Glu Trp He Ve 
790 



Lys Gly 
Gly Thr 
Phe Tyr 



He Leu 
780 

He Ser 

795 



Asp Glu Glu Asn 
735 

Lys He Cys Glu 
750 

Thr He Ser Tyr 
765 

Val Arg Met Asp 



Tyr He Val Ser 
800 



Leu Glu Lys He 
805 

Gin Lys He Lys 
820 

Val Ala He Ser 
835 



Arg Glu He Let 
81C 

Val Trp Leu Gin' 
825 

Thr Phe Met He 
840 



Met Ser Glu Pro Gly Lys 
815 

Glu Tyr Trp Asn He Thr 
830 

iGly Ala He Leu Arg Leu 
845 
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# 




Gin Aan Gin Pro Tyr Mot Gly Tyr Gly Arg Val lie Tyr Cya Val Asp 
850 I 855 860 

lie He Ph Trp Tyr Ile\Arg Val Leu Asp He Phe Gly Val Asn Lys 
865 870\ 875 880 

Tyr Leu Gly Pro Tyr Val Met Met He Gly Lys Met Met He Asp Met 
885 \ 890 895 

Leu Tyr Phe Val Val He MAt Leu Val Val Leu Met Ser Phe Gly Val 
900 \ 905 910 

Ala Arg Gin Ala He Leu His Pro Glu Glu Lys Pro Ser Trp Lys Leu 
915 \ 920 925 

Ala Arg Asn He Phe Tyr Meti Pro Tyr Trp Met He Tyr Gly Glu Val 
930 935\ 940 

Phe Ala Asp Gin He Asp Leu\Tyr Ala Met Glu He Asn Pro Pro Cys 

945^....^ ) 950 \ 955 960 

Asn Leu Tyr Asp Glu dlu Gly Lys Arg Leu Pro Pro Cys He 
965 \ 970 975 



Ala Trp Leu Thr Pro 
980 



Al^ Leu Met Ala Cys Tyr Leu Leu Val 
985 990 



He Leu Leu Val Asn Leu\ Leu He Ala Val Phe Asn Asn Thr 
995 1000 1005 

Phe Glu Val Lys Ser He Ser Asn Gin Val Trp Lys Phe Gin Arg 
1010 1015 \ 1020 

Tyr Gin Leu He Met Thr Phe His Mp Arg Pro Val Leu Pro Pro Pro 
1025 1030 \ 1035 1040 

Met He He Leu Ser His He Tyr iVe He He Met Arg Leu Ser Gly 
1045 \ 1050 1055 



Arg Cys Arg Lys Lys Arg Glu Gly AspvGln Glu Glu Arg Asp Arg Gly 
1060 1065\ 1070 

Leu Lys Leu Phe Leu Ser Asp Glu Glu veu Lys Arg Leu His Glu Phe 
1075 1080 \ 1085 



Glu Glu Gin Cys Val Gin Glu His Phe 
1090 1095 



Glu Lys Glu Asp Glu Gin 
1100 



Gin Ser Ser Ser Asp Glu Arg He Arg Val\ Thr Ser Glu Arg Val Glu 
1105 1110 \ 1115 1120 

Asn Met Ser Met Arg Leu Glu Glu He AsnlGlu Arg Glu Thr Phe Met 
1125 1130 1135 

Lys Thr Ser Leu Gin Thr Val Asp Leu Arg Leu Ala Gin Leu Glu Glu 
1140 1145 \ 1150 
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• 




Leu Ser Asn Arg Mat ValXAsn Ala Leu Glu Asn Leu Ala Gly He Asp 
1155 \ 1160 1165 

Arg Ser Asp Leu He In Ala Arg Ser Arg Ala Ser Ser Glu Cys Glu 
1170 l\l75 1180 

Ala Thr Tyr Leu Leu Arg Gin Ser Ser He Asn Ser Ala Asp Gly Tyr 
1185 1190 \ 1195 1200 

Ser Leu Tyr Arg Tyr His Phei Asn Gly Glu Glu Leu Leu Phe Glu Asp 
1205 \ 1210 1215 

Thr Ser Leu Ser Thr Ser Pro \Gly Thr Gly Val Arg Lys Lys Thr Cys 
1220 I 1225 1230 

Ser Phe Arg He Lys Glu Glu Lys Asp Val Lys Thr His Leu Val Pro 
1235 1240 1245 

Glxi^jefB^ln Asn Ser Leu His Leu Ser Leu Gly Thr Ser Thr Ser Ala 
1250/ 1255 \ 1260 

Thr Pr6 Asp Gly Ser His Leu A Ik Val Asp Asp Leu Lys Asn Ala Glu 
1265y 1270 \ 1275 1280 

Glii Ser Lys Leu Gly Pro Asp Ile\Gly He Ser Lys Glu Asp Asp Glu 
1285 I 1290 1295 



Arg Gin Thr Asp Ser Lys Lys Glu Glu Thr He Ser Pro Ser Leu Asn 
1300 1305 1310 

Lys Thr Asp Val He His Gly Gin AWp Lys Ser Asp Val Gin Asn Thr 
1315 1320 \ 1325 

Gin Leu Thr Val Glu Thr Thr Asn IlW Glu Gly Thr He Ser Tyr Pro 
1330 1335 \ 1340 

Leu Glu Glu Thr Lys He Thr Arg Tyr\ Phe Pro Asp Glu Thr He Asn 
1345 1350 \ 1355 1360 

Ala Cys Lys Thr Met Lys Ser Arg Ser Phe Val Tyr Ser Arg Gly Arg 
1365 1370 1375 

Lys Leu Val Gly Gly Val Asn Gin Asp Val Glu Tyr Ser Ser He Thr 
1380 1385 \ 1390 



Asp Gin Gin Leu Thr Thr Glu Trp Gin Cyi Gin Val Gin Lys He Thr 
1395 1400 \ 1405 

Arg Ser His Ser Thr Asp He Pro Tyr IlelVal Ser Glu Ala Ala Val 
1410 1415 \ 1420 

Gin Ala Glu Gin Lys Glu Gin Phe Ala Asp ket Gin Asp Glu His His 
1425 1430 1435 1440 



Val Ala Glu Ala He Pro Arg He Pro Arg Leu Ser Leu Thr He Thr 
1445 1450 1455 
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Aap Arg Aan Gly Met Glu Asn Leu Leu Ser Val Lys Pro Asp Gin Thr 
1460 \ 1465 1470 

Leu Gly Phe Pro SerlLeu Arg Ser Lya Ser Leu His Gly His Pro Aro 
1475 \ 1480 1485 

Asn Val Lya Ser He iln Gly Lys Leu Asp Arg Ser Gly His Ala Ser 
1490 \ 1495 1500 

Ser Val Ser Ser Leu wkl He Val Ser Gly Met Thr Ala Glu Glu Lya 
1505 ISHO 1515 1520 

Lya Val Lya Lya Glu Lyk Ala Ser Thr Glu Thr Glu Cya 
1525 \ 1530 



174591.B11 



